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Apparently there are those who do not subscribe 


to an all-Microsoft approach. 


How can we be surprised? With all the mainframes, 
clients, application servers, databases and who 
knows what else you guys run in your enterprise, It 


would surprise us if any of you did. 


That’s why we're working on making all technologies 
(both new and old) work together in one interoperable 
computing environment. For example, Microsoft’ 
Windows NT’ and Microsoft Visual Studio develooment 
system enable developers to build solutions that 
integrate data from mainframe and UNIX server 
alike. It’s just a first step in building solutions that 
incorporate all the pieces of your enterprise puzzle. 
Because we know that, in the end, the only approach 
youre going to subscribe to is your own. 


To learn more go to www.microsoft.com/msdn 
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DR. DOBB’S JOURNAL EXCELLENCE IN PROGRAMMING AWARDS 20 
by Jonathan Erickson 

The “Gang of Four’—Richard Helm, Erich Gamma, Ralph Johnson, and John Vlissides — 

are recipients of this year’s annual award that honors achievement in the world of 

software development. 


SOFTWARE QUALITY 22 
by Shari Lawrence Pfleeger 

The metrics Shari describes here help you better understand code, control testing, and 

predict faults and failures. 


BENCHMARKS: FACT, FICTION, OR FANTASY? 30 
by Robert R. Collins 

Can you really trust published benchmarks? By making a 166-MHz Pentium computer 

seem to outperform a 300-MHz Pentium II system, Robert shows why healthy skepticism 

is a useful trait. Brian Butler then presents a sample database benchmark. 


TRACING BSD SYSTEM CALLS 38 


by Sean Eric Fagan 
How do you debug a program that doesn’t have source or debugging symbols? One 


) « 


way is to watch the system calls it makes. Sean’s “truss” utility lets you do exactly this. 


A REAL-TIME PERFORMANCE VISUALIZER FOR JAVA 4a 
by John J. Barton and John Whaley 

Find out where the performance bottlenecks in your Java programs are with the JVM 

performance visualizer presented here. 


picoPERC: A SMALL-FOOTPRINT DIALECT OF JAVA 50 
by Kelvin Nilsen 

picoPERC is a Java subset in which the core Virtual Machine implementation fits in less 

than 64 KB of memory. This 64-KB footprint is nearly 1/16th the size of JavaSoft’s yet-to- 
be-defined Embedded Java and over 50 times smaller than typical Enterprise Java 

implementations. 


VERCHECK: DISCOVERING COMPONENT VERSION NUMBERS 99 
by John Graham-Cumming 

VerCheck, the utility John presents here, gives you a list of the versions of all the 

components of relevance to your program. 





BUILDING INTELLIGENT WEB-BASED CONTROL SYSTEMS 36 
by Tom Milligan and Steve Coffin 

Our authors shows how to embed web servers onto embedded devices and develop 

web-based user interfaces. To illustrate, they automate a sprinkler system using the 

Embedded Micro Interface Technology toolkit. 


TRANSPARENT ATL CONTROLS N2 
by Tom Armstrong and Mark Nelson 

Tom and Mark use Microsoft's Active Template Library to build an ActiveX control that 

displays a bitmap with a single transparent color. You can then use the control with 

Internet Explorer, Netscape Navigator, Visual Basic, and most other ActiveX containers. 
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SOURCE-CODE PROFILERS FOR WIN32 

by Ron van der Wal 

Ron examines profiling tools that target Win32 C/C++ development. These tools include 
Intel’s VIune, Microsoft’s Visual C++ 5.0 profiling tools, Rational’s Visual Quantify, 
TracePoint’s HiProf, Watcom’s C++ 11.0 tools, and those that come with the Win32 SDK. 


WINDOWS NT DEVICE DRIVER TOOLKITS 

by Patrick Tennberg 

Writing device drivers in C using the Windows NT Device Driver Kit can be scary. 
Patrick examines alternative toolkits, such as BlueWater Systems’ WinDK and Vireo 
Software’s Driver::Works. 
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PROGRAMMING PARADIGMS 

by Michael Swaine 

Michael doesn’t do benchmarks. Well, he does, but he doesn’t like to. What he really 
likes to do is read Dilbert— and he takes issue with those who don't. 


C PROGRAMMING 

by Al Stevens 

Al launches a new project this month— the C++ Persistent Template Library, for adding 
persistence to containers. 


JAVA Q&A 

by Govind Seshadri 

Java’s Remote Method Invocation brings distributed-object computing to Java. Govind 
examines the intricacies of enabling true peer-to-peer Java RMI interaction. He then 
presents a step-by-step approach to implementing callbacks. Cliff Berg will return 
next month. 


ALGORITHM ALLEY 

by William Stallings 

Many programs need to predict the behavior of external systems. William shows how 
exponential smoothing fills the bill for a variety of applications. 


UNDOCUMENTED CORNER 

by Robert L. Collins 

Robert continues his discussion of the Pentium’s Virtual Mode Extensions, starting with a 
description of the various components of VME and how they work together. 


PROGRAMMER’S BOOKSHELF 

by Jeff Cromwell 

If you need to learn more about ISAPI, the books Jeff examines here may be just the 
place to start. These books include Using ISAPI, by Stephen Genusa et al.; Professional 
Visual C++ ISAPI Programming, by Michael Tracy; and Programming ISAPI with Visual 
Basic 5, by Wayne S. Freeze and Tim Ritchie. 
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EDITORIAL | 6 
by Jonathan Erickson 

LETTERS 8 
by you 

NEWS & VIEWS 18 
by the DDJ staff 

OF INTEREST 145 
by Eugene Eric Kim 

SWAINE’S FLAMES 152 
by Michael Swaine 


RESOURCE CENTER 


As a service to our readers, source code (and related 


- files), back-referenced articles, and relevant links 


are available electronically at this month’s online 
Table of Contents at http://www.ddj.com/. Source 
code is also available via anonymous FTP from 
ftp.ddj.com (199.125.85.76), the DD/ Forum on 
CompuServe (type GO DDJ), and DDJ Online (650- 
358-8857, 14.4 kbps, 8-N-1). Source-code diskettes 
can be ordered ($14.95, California residents add 
sales tax) by mail, fax (650-358-9749), or phone 
(650-655-4100 x5701). Letters to the editor and article 
proposals/submissions should be mailed or faxed 
to the DDJ office or sent electronically to 
editors@ddj.com. Author guidelines are available 
at http://www.ddj.com/. Send inquiries or requests 
to Dr. Dobb’s Journal, 411 Borel Ave., San Mateo, 
CA 94402. For subscription questions Gncluding 
change of address), call 800-456-1215 (U.S. and 
Canada); other countries, call 303-678-0439 or 
fax 303-661-1885. E-mail subscription questions to 
71572.341@compuserve.com or write to Dr. Dobb's 
Journal, P.O. Box 56188, Boulder, CO 80322-6188. 


NEXT MONTH 


April showers bring algorithms, and in this 
issue, we will focus on algorithm design. 
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Speed is essential in all database projects, but not at the exp 

wouldn't try to go 100 miles per hour with your bicycle! The same is true in database 
technology. FairCom has been delivering fast, safe, fullfeatured database engines to 
the commercial marketplace for 19 years. Proven on large Unix servers and 
workstations, c-tree Plus’s small footprint and exceptional performance has also 
made it the engine of choice for serious commercial developers on Windows and 
Mac. Check out www.faircom.com for detailed information. You'll be glad you did. 


c-tree Plus® key features for $895: 
- Royalty Free - Roll-forwards /Roll-backwards 
Portable Multi-Threaded API Easy make system 

Complete C Source Advanced Variable Length Records 








- Thread Safe Libraries - BLOBS 

- Standalone or Client/Server - Space Management 

- Complete Transaction Processing, - File Level Security 

including automatic recovery - Conditional Index 
a : - Save-points - ODBC/Java Interfaces 

Platforms: Abort/Commit Over 25 Developer Servers included 
MIPS ABI DEC Alpha Sun SPARC Windows 95 SCO Banyan VINES 
880PEN OSF/ 1 DOS Windows NT LINUX (Alpha/Sparc/inte) QNX 
AIX HPSOO00 OS/2 Windows 3.1 AT&T System V Chorus 
RS/6000 Sun OS Mac Interactive Unix Netware NLM Lynx 
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Half of your Client/Server project is the Server! You control 100% of your Client 
Side. Why settle for less on your Server side? Move your functions to the server- 
side to decrease network traffic and increase performance! 


Today's database demands may often be too complex for traditional Relational Model 
Database Servers. Server needs come in many different sizes and shapes. What 
better way to accommodate these requirements than by allowing the developer to 
take full control of the Server side? FairCom’s Server Development System was 
created to meet this need. It provides the developer the means to create an 
industrial strength Server. Complete make-files are included for all FairCom 
commercial platforms. With our proven kernel add or override existing database 
functionality or create your own special multi-threaded server: 


Application Server Network Gateway Server 
special Web Server Departmental Database Server 





Data Warehouse 
Embedded Servers 


FairCom Server Development System key features: 
Provides complete source code for all the interface subsystems to the FairCom 
Server. Server mainline, Communication, Threading, Remote function interfaces and 
procedure calls are all supplied in complete C source code together with the 


FairCom Server sophisticated thread-safe kernel libraries. 


Customizable Rollback-Forward Data History Conditional Index 
Transaction Processing Anti-Deadlock Resolution Multiple Protocols Small Memory Footprint 
Online Backup Client Side Source Heterogeneous Networking OEM pricing 

Disaster Recovery Multithreading File Mirroring ODBC/Java interface 
Key level locking 











FairCGony 
Corporat erie 


Commercial Database Technology. Since 1979. 


®° USA. 800.234.8180 


Phone: USA 573.445.6833 - EUROPE +39.35.773.464 - JAPAN +81.0592.29.7504 - BRAZIL +55.14.224.1610 
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We’ve just changed the way 
you'll see Visual C++ forever! 


Stingray Software presents Visual CASE, the first Object 
Oriented Design tool you will actually use. With Visual Case 
you'll see your code in a whole new way - graphically. Unlike 
other CASE tools that try to be all things to all languages, Visual 
CASE was designed specifically for Microsoft Visual C++. 
This means it completely understands MFC, COM, Activex 
and ATL. Now you can have true round trip engineering allow- 
ing you to update your application either by manipulating the 
graphical model or your source code. No longer does Object 
Oriented Design have to be a pipe dream that doesn’t work in 
the real world. 








Sound too good to be true? Download a FREE fully functional 
copy today and — Get Visual! 





Visual On kys ie eer See what you’ve loYsrsyal missing. aoe 
919-461-0672 


WWW. S t : Nn Gg " rs | y mOnek sal aoe sales @stingray.com 


AD LINK 286 














Let Us 
Now Praise 
Famous Stuff 





ULES 





closets. The automobile industry, for instance, can boast that the first car was manufactured in 

1896 by the Duryea Motor Car Company—with the first automobile accident occurring shortly 
thereafter, when one of those cars became one with a bicycle. This was followed by the first 
stoplight in Detroit in 1914, the first car radio in 1930 (and the melding of “motion” and “radio” 
into “Motorola”), and, in 1958, the Edsel. As for the movie biz, who can forget D.W. Griffith’s 1915 
Birth of a Nation, Orson Wells’ famous 1941 Citizen Kane ceiling shot, or Joe Eszterhas and Paul 
Verhoeven’s 1995 Showgirls. 

The computer industry has its share of milestones, too. Recently celebrating its 50th birthday, 
the transistor was developed by physicists John Bardeen, Walter Brattain, and William Shockley at 
AT&T Bell Labs in December, 1947. (The trio subsequently received the 1956 Nobel Prize for their 
invention.) A few years later, Fairchild’s Robert Noyce and TI’s Jack Kilby independently figured 
out how to put transistors and other components onto silicon, launching us into the 
semiconductor age. Although originally designed to control and amplify electrical signals in long- 
distance telephone calls, transistors are today found everywhere, fueling an estimated $866 billion 
high-tech industry employing more than 4.25 million people in the U.S alone. 

So how important are transistors? Well, if it weren’t for semiconductors, we’d never have had 
Donkey Kong, Space Invaders, or Super Mario Bros., let alone Myst and Riven. Yes, it’s hard to 
believe, but video games have passed a milestone, too, celebrating their 35th birthday. Today’s 
video games are, of course, a far cry from Spacewar, the first video game, written by MIT student 
Steven Russell (with help from Pete Sampson and Dan Edwards) in 1962. (Yes, there are those 
who would argue that “the” first video game was Brookhaven National Lab’s William 
Higinbotham’s 1958 “Tennis for Two,” which was played on an oscilloscope. Okay, so it’s really 
the 40th anniversary of video games. I can live with that.) Written to run on a DEC PDP-1 (one of 
the first computers sporting a video display), two players played Spacewar by flipping toggle 
switches to maneuver spaceships. One of the few people at the time who had access to a PDP-1 
and a penchant for playing video games was, of course, University of Utah student Nolan 
Bushnell, who eventually founded Atari to market Pong and a ton of other games. As for 
Spacewar, DEC ended up using it as a tool for testing its computers, in much the same way Flight 
Simulator was later used to gauge Windows compatibility. 

To celebrate the history of video games, by the way, the SciTrek Science and Technology 
Museum (Atlanta, Georgia) has put together a roadshow called the “Cyber Playground” that traces 
the evolution of video games. The exhibit features working arcade games such as Sprint (1976), 
Asteroids (1979), and Battlezone (1980), along with information on computer history, computer 
graphics, and the like. For more information on Cyber Playground, see http://www.scitrek.org/. 

One milestone that certainly shouldn’t be missed is the 20th anniversary of Brian Kernighan 
and Dennis Ritchie’s 1978 C Programming Language. A second edition (published in 1988) 
covered C as defined by the ANSI Standard. C Programming Language is a no-nonsense book— 
written by the guys who created the language in the first place— that took a powerful 
programming language into the mainstream. In doing so, C Programming Language set a 
standard for programming books in terms of completeness, conciseness, and accuracy that’s rarely 
been matched over the past 20 years. 

Just to keep things in perspective, the March 1978 issue of Dr. Dobb’s Journal, led by 
Haphazard Editor Jim Warren and Actual Editor Tom Williams (hey, I’m not making this stuff up) 
included a status update on the UCSD Pascal Project; “Renumbering and Appending BASIC 
Programs on the Apple II Computer,” by Steve Wozniak; “STRUBAL: A Structured BASIC 
Compiler,” by Robert Grappel; “A Z-80 Tracer,” by Arthur Cline; and “ACT: An 8080 
Macroprocessor,” by Alex Cecil. 

Finally, it was 15 years ago that Richard Stallman, frustrated by the program editors available 
for LISP at the time, sat down and wrote Emacs— a program that more or less introduced the 
notion of “free software” (that’s “free” as in “freedom,” not “price,” as Stallman would be quick to 
point out). Emacs led to the 1983 GNU project, which was conceived to foster a cooperative spirit 
among software developers. Over the past 15 years, free software has spawned a raft of software 
(most notably Linux, Perl, and the like) and a slew of companies supporting it (Cygnus, Red Hat, 
Slackware, O'Reilly, and others). 

We've come a long way over the years— but the fun will be in seeing where we go from here. 


Poulin 


Jonathan Erickson 
editor-in-chief 


F industry has its scrap heap of legends, milestones, and skeletons hanging in corporate 
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Making software work together ™ 


Making software work together - it's that simple. No matter how or where your systems are implemented, 
OrbixOTM will bring them together. Mainframe to Windows, Unix to Web - application integration has never 
been easier. Which allows you to focus on what really matters - using the best technology in the right place. 
And OrbixOTM lets you evolve your systems over time, adding new ideas to the reliable solutions that your 


business is built on. 


But OrbixOTM does more than just integrate. It provides a complete enterprise solution. It features extensive 
and natural support for transactions, security, and systems management. There's advanced deployment 
support, load balancing and fault resilience. Which all means distributed object technology is now a long way 


from the lab. It's providing competitive advantage in the real world. 


But then chances are your competition already knows that 





Making software work together™ 


IONA Technologies at 1-800 orbix4u  info@iona.com www.iona.com - Technologies 
Dublin Boston Perth San Mateo Hong Kong Frankfurt Washington DC London 


‘Orbix’ is a registered trademark of IONA Technologies PLC. ‘Making software work together’ is a trademark of JONA Technologies PLC. All other products or services mentioned herein are trademarks of their respective owners. 13290 
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Something in the Air 

Dear DDJ, 

We are writing on behalf of our client, 

Lockheed Martin Corp., to advise you that 

“skunk works” is not a generic term but 

is, rather, Lockheed Martin’s federally reg- 

istered service mark. In March, 1997, Dr. 

Dobb’s Journal published an article by 

Michael Swaine entitled “It Was Not To Be; 

Apple Decides To Acquire Steve Jobs’ 
NeXT Operating System Instead Of The 
Be System,” which made a generic refer- 
ence to the term “skunk works.” Lockheed 
Martin requests that should any future ar- 
ticle in Dr. Dobb’s Journal refer to the term 
“skunk works,” the article acknowledge 
that such term is a registered service mark 
of Lockheed Martin. 
The Skunk Works mark was registered 
by Lockheed Martin on September 18, 
1973, pursuant to Certificate of Registra- 
tion No. 968,861 for “engineering, techni- 
cal, consulting and military aircraft and re- 
lated equipment.” It was reregistered by 
Lockheed Martin on July 14, 1981, pur- 
suant to Certificate of Registration No. 
1,161,482. However, Lockheed Martin has 
made common law use of the Skunk 
Works mark for over 50 years. 
Lockheed Martin’s use of the mark traces 
to 1943. In June of that year, Lockheed 
Martin and its premier aircraft designer, 
Clarence L. “Kelly” Johnson, promised the 
USS. Air Force that in less than six months 
they would deliver a prototype jet fighter. 
Johnson then put together a highly skilled 
and closely guarded group of cutting edge 
engineers and production personnel. Their 
work was performed in absolute secrecy 
and, just 143 days later, they delivered the 
first American jet plane, the XP-80, to the 
Army Air Corps. By January 1944, the XP- 
80 had evolved into the P-80, which was 
America’s first operational jet fighter. 

The Lockheed Martin facility, which de- 
veloped the XP-80 was soon given the col- 
orful nickname “Skunk Works.” The name 
was derived from the popular Zi? Abner 
comic strip, which used the name “skonk 
works” to refer to the place where “Kicka- 
poo Joy Juice” was secretly manufactured. 
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War-time secrecy precautions required that 
the Lockheed Martin engineers not identify 
their office when answering the telephone. 
This reminded one engineer of the isolation 
of “skonk works” in the Zi7 Abner comic 
strip. When a group of Pentagon officers 
placed a call to the Lockheed Martin facili- 
ty, the engineer answered the telephone by 
saying “Skonk Works.” The Lockheed Mar- 
tin “Skunk Works” facilities have consistently 
been creative and successful, and the “Skunk 
Works” name has become identified with 
Lockheed Martin’s unincorporated division, 
which operates those facilities, now known 
as the Lockheed Martin Skunk Works. 
In the half-century since they developed 
the XP-80, the talented Lockheed Martin 
“Skunk Works” personnel have developed 
a continuing series of aircraft which have 
made the United States the unquestioned 
leader in military aviation. Those aircraft 
whose existence is no longer classified in- 
clude: the F-104 Starfighter— the first Mach 
2 fighter aircraft (1954); the U2— an ad- 
vanced reconnaissance and research aircraft 
which is still being flown and which is the 
highest flying single engine airplane (1955); 
the SR-71 Blackbird— the first plane to fly 
at Mach 3 and still the highest flying and 
fastest aircraft ever developed (1964); the 
F-117 Stealth Fighter— the first operational 
aircraft designed for low observability, 
used extensively in Operation Desert 
Storm (1981); and the F-22 Advanced Tac- 
tical Fighter (1992). These and other 
achievements of Lockheed Martin’s “Skunk 
Works” personnel are described more ful- 
ly in a book entitled Lockheed’s Skunk 
Works: The First Fifty Years, by Jay Miller 
(Aerofax, 1993). 

Lockheed Martin cannot permit its valu- 
able mark to become generic and intends 
to take whatever action necessary to enjoin 
misuses of its mark. Lockheed Martin must 
therefore request that should Dr. Dobb’s 
Journal again refer to “skunk works,” the 
reference reflect that the term is a registered 
service mark of Lockheed Martin. 

Adam D. Samuels 
Quinn, Emanuel, Urquhart, & Oliver, LLP 
Los Angeles, California 


DDJ Responds: Thanks for your most in- 
teresting letter, Adam. By the way, as you 
probably know, like “Skunk Works,” the 
term “Lil Abner” is a registered trademark. 
Perhaps as a matter of courtesy to Capp 
Enterprises (the holder of the trademark), 
you should acknowledged that trademark 
in your correspondence. 


More On Ada 

Dear DDJ, 

Readers who enjoyed Gavin Smyth’s article 
“GNAT: The GNU New York University ADA 
Translator” (DDJ, December 1997) might be 


interested in knowing that the GNAT com- 
piler for DOS comes with a free graphics 
package (ftp://cs.nyu.edu/pub/gnat/ez2load 
/vgapck06), which includes an interface to the 
DJPP library and a keyboard interface. 

An SVGA version of this package is freely 
available from http://www. pi.net/~dijklibo/. 
A similar package for the GNAT for the Win- 
dows 95/NT compiler (and also the Aonix 
ObjectAda Compiler) is also freely available 
through http://stad.dsl.nl/~jvandyk/. 

Jerry van Dijk 

jdijk@acm.org 


Recycling PCs 

Dear DDJ, 

I agree with Jonathan Erickson’s in his ed- 
itorial “Giving it the Reboot” (DD/, Novem- 
ber 1997) that we need more options for 
electronics recycling than creating artwork 
and sculpture. 

Readers might also be interested in the 
Electronic Product Recovery and Recycling 
(EPR2) Roundtable, which held its first 
meeting last fall in Washington, DC. A 
summary of the meeting can be found at 
http://www.nsc.org/ehc/epr2.htm. 

Dawn Amore 

Environmental Health Center 

amored@nsc.org 


Server-Side Scripting 
Dear DDJ, 
In “Server-Side Scripting in Visual Basic,” 
(DDJ, September 1997), Al Williams cov- 
ered Microsoft’s ASPs and server-side VB- 
Script. In his summary, Al claimed that 
ASP “certainly goes a long way toward 
making those tools [CGI & ISAPI] obso- 
lete (under IIS, anyway).” The fact is that 
there are tools with the same functional- 
ity for other HTTP servers as well. 
Netscape has a parallel technology 
called “LiveWire Pro,” which is part of 
Netscape’s Commerce Server, or a plug- 
in for it. Furthermore, there is a freely 
available tool for server-side scripting 
called PHP/FI that is accessible from any 
HTTP server that supports CGI scripts (ba- 
sically all), and can also run as a module 
of the Apache web server. For more de- 
tails, check Netscape’s homepage (http:// 
home.netscape.com/) or PHP/FI’s: http:// 
php.iquest.net/. 
Shlomi Fish 
shlomi@slink.co.il 


Stronger Encryption 

Dear DDJ, 

I really enjoyed the interview with Ron 

Rivest (DDJ, November, 1997). It just 

scratched the surface of the cryptographic 

issues we are now faced with. In today’s 

world, a growing need for automatic en- 

cryption of data needs to be implemented. 
(continued on page 12) 
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INTRODUCI 


There’s just been a major evolutionary leap over all lower forms of workstations. Darwin. A full-fledged 
Sun™ workstation (with an up to 300MHZz processor, no less) that lets you run all your favorite PC 
apps. All while delivering the power, scalability, networkability, and proven robust UltrasPARC™/Solaris™ 


performance you've come to expect from Sun. And best of all, for the price of a PC running Microsoft® 
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| _ IN PRICE IN ADVANTAGES | | IN GRAPHICS 
| : CDRS Graphics Benchmark 
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| 64MB/4GB system 64MB/4GB system high-performance graphics | UitraSPARC Ili 300MHz R10000 195MHz 
| 128MB/4GB system 128MB/4GB system 
| Up to 1GB ECC memory | | $42,495 $45,495 
Industry standard dual PCI busses | | | 
$4,299" | | 
: Robust multithreaded | | 66.0 
Solaris operating environment | 
| $2,995* Runs PC productivity | 48.6 
| applications seamlessly (including Doom®) | | 
Internet ready (Netscape™) | 
Java™ technology-enabled | 
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*without monitor 


Windows® NT®-—it starts at just $2,995. (Which, we think you’ll agree, isn’t just evolutionary, but fantas- 


tic.) With Darwin, you can run heavy-duty technical applications one moment, then craft a presentation 
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using Microsoft Office® the next. What’s more, its new Elite3D graphics will blow away a similar SGI 


machine at less than a third of the cost. And since Darwin is binary compatible, it’s a 





perfect entry point to our full line of Sun systems (which, with up to 64 processors, 


can expand to meet anyone’s needs). All ready to run the over 2,000 technical 






applications available for Sun without altering any of them one iota. And last but not least, Sun Ultra 5 


"2,999 


it’s easily networked (remember who's making it). For more information, call 800-SUN-FIND for a Sun 


> SUN 


microsystems 
reseller or representative near you. Or stop by our Web site at sun.com/ult/drd. You'll find that in the 


workstation world, like the real world, it’s survival of the fittest. THE NETWORK IS THE COMPUTER” AD LINK 417 
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Microsystems, the Sun Logo, Solaris, Java, Ultra and The Network Is The Comp uter are trademarks or registered trademarks of Sun Microsystems, Inc. in the United States and other countries 
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(continued from page 8) 

Many users will never learn that all of the 
information they are sending around on net- 
works is not protected. We need encryption 
to become standardized such as data com- 
pression has in the communication field. 

DES or even TRIPLE DES is not the way 
to go. It is long past its shelf life and needs 
to be retired. As soon as this is realized 
we can get on with a better method. Pub- 
lic key is great for e-mail systems, but it 
still has a weakness of key substitution. 
Bruce Schneier discussed this in detail in 
his Applied Cryptography — the weakness 
in many routines in use today. 

As computer power grows, the ability 
to crack the encryption routine becomes 
easier. Electronic voting is a long way in 
the future and it will need even longer for 
the public to believe in it. 

Scott Schwendinger 

sidewinder@sincomm.com 


Inner Loops 
Dear DDJ, 
Although I enjoyed (and agree with) 
Robert Bernecky’s review of Rick Booth’s 
book Inner Loops (““Programmer’s Book- 
shelf,” DD/J, December 1997), I believe he 
missed one of the strongest points Booth 
was trying to make. 

Booth advocates structured assembly- 


OSR Open Systems Resources 
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Expert Tool Kits are also available 
to jump start your projects 


language programming, which is, as he 
says, an excellent tool for revealing weak- 
nesses in design as well as outright bugs 
in the code itself. I have not seen any oth- 
er assembly-language text that encourages 
a reduction in the use of labels and the 
extensive use of MASM’s C-like conditional 
statements. 

I agree with Bernecky that the book 
was unusually well edited, but there are 
certainly more than two typos. One ad- 
ditional correction, at least, can be found 
at Booth’s web site (http://ourworld.com- 
puserve.com/homepages/rbooth/). Un- 
fortunately, the publisher chose a cheap 
bindery method to print this otherwise ex- 
cellently crafted manuscript, and my copy 
broke apart into three pieces on the sec- 
ond day of reading! I had to punch holes 
in it and find a three-ring binder for it. 

I believe programmers in the trenches 
like me need more books of this quality 
(content-wise) instead of the many thou- 
sand-page, gizmo-infested wonderbooks 
that don’t really say a damn thing. 

Arvid R. Hand, Jr. 

Willis, Texas 

bhand@compuserve.com 


Einstein Kudos 
Dear DDJ, 
In his July 1997 DDJ “Editorial,” Jonathan 


@ DEVELOPING NT FILE SYSTEMS — 5 day course 

@ NT KERNEL MODE DEVICE DRIVERS — 3 day course 
Palo Alto, CA starts 3/9/98 * Munich starts 4/6/98 

Nashua starts 4/27/98 * Tokyo starts 5/1 1/98 


CUSTOM DEVELOPMENT and CONSULTING 


Cutting edge software and on-time delivery — guaranteed! 


OSR + Open Systems Resources, Inc. 


CUSTOMIZED DEVELOPMENT, CONSULTING AND TRAINING 


105 Rte. 1O1A, Suite 19 +» Amherst, NH 03031 USA 
voice: +] (603) 595-6500 . 


e-mail: info@osr.com « web: www.osr.com 


fax: +1 (603) 595-6503 


888-677-4264 
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Erickson refers to Nodland and Ralston’s 
research that “...brought into question Ein- 
stein’s theories concerning the constant 
speed of light in a vacuum.” 

The constancy of the speed of light was 
not one of Einstein’s theories. That is a fact 
about the nature of the universe that was 
demonstrated empirically by actual mea- 
surements, in the now-classic Michaelson- 
Morley experiments. 

Einstein’s genius was in determining, 
and his contribution lay in the results 
achieved by his demonstrating, the in- 
evitable consequences of that fact. These 
are too numerous to mention in a brief 
letter, but include both the theoretical (ex- 
plaining the precession of Mercury, and 
the gravitational bending of light) and the 
practical (the development of the atomic 
bomb) and everything in between. 

Now, Nodland and Ralston may or 
may not have shown that light speed re- 
ally is not constant, but that has noth- 
ing to do with Einstein’s work. His The- 
ory of Relativity still stands as one of the 
two most successful physical theories of 
all time (the other one is Quantum Me- 
chanics). 

Howard Mark 

Suffern, New York 
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If you have an imaging project in front of you, choose the #1 toolkit in the market. 
LEADTOOLS is an award winning imaging toolkit that puts more than 7 years of 
development and millions of lines of code at your finger tios. You will save countless 


hours and have access to the best imaging technology available. 


Imaging 


technology that Microsoft, Hewlett Packard, Corel, Xerox and thousands of others have 
chosen for their products. With LEADTOOLS, your project is almost done! 


VIEWING & SPECIAL EFFECTS 


LEADTOOLS provides optimized rendering 
of any image fo all display devices with 
monitor calibrations, auto-dithering, 
scaling, Zooming, scrolling, and animation. 
Choose from 113 Paint Effects, 64 Dissolve 
Effects, and 24 Transition Effects, including 
pushes, pulls, wipes, splits, blinds, Crushes, 
rolls, circulars, diagonals, stretches and 
many more with delays, grain sizes, pattern 
brushes (up to 64 passes), a colored wand 
and a transparent color. The combinations 
are unlimited. Other Special Effects 
include: 3D Shapes, 3D Rotated Text, 3D 
Frames, Gradient and Pattern Filled shapes 
and much more. 


INTERNET/INTRANET 


LEADTOOLS features a Net Aware Activex 
and a Netscape plug-in for 
Internet/Intranet applications, including a 
Bitmap Datapath allowing images to be 
read from any URL, Progressive JPEG, 
Progressive CMP and support for GIF 
interlace, transparency, animation, and 
embedded text. A FeedLoad function has 
been created to allow image data to be 
displayed as it is being transmitted across 
the net. 


DATABASE 


LEADTOOLS has specific features designed 
for the imaging database developer: VB 
data binding, 32-bit ODBC, a customized 


SCANNING 


LEADTOOLS supports high soeed scanning 
using ISIS and TWAIN. TWAIN includes both 
16 & 32 bit native and buffered RAM 
transfer modes. 


PRINTING 


LEADTOOLS performs all image processing 
necessary to print directly to any Windows 
supported printer, with the ability to print 
text and multiple images on the same 
page. 


COMPRESSION 


LEADTOOLS offers more compression 
options than any other toolkit on the 
market, in both standard and proprietary 
formats. 


THE POWER OF LEADTOOLS 


LEADTOOLS is a collection of more than 
400 functions, properties and methods 
that provide low level functions for 
complete control, all the way to the 
highest level functions for ease of use. 
Imaging Common Dialog Boxes, (new to v. 
9.0) greatly simplifies integration. Royalty 
free and available as 16 & 32 bit DLLs, 16 & 
32 bit ActivexXs, and a VBX and includes 
extensive source code examples for 
Microsoft Visual C/C++, Borland C/C++, 
Microsoft Visual Basic, Borland 
Delphi,Microsoft Visual FoxPro, Microsoft 
Access, VB and Java Script. 











“LEADTOOLS is an indispensable 
tool, we used it in the development of 
our FrontPage application". 


Tom Button 

Director of Marketing, 

Internet Platform & Tools division 
Microsoft 


"The file format support is phenome- 
nal, LEADTOOLS gave Micrografx's 
applications (like CreataCard™ and 
Picture Publisher™) the flexibility to 
import and export a wide variety of 
established and new file formats with 
ease. Their format implementations 
are complete and performance Great!" 





OLE 2.0 in-place server, Load/Save 

memory, and Load/Save file offset. NOW F eS LEADTOOLS IMAGING Andy Cohen 
eee 22888 Director Software Development 

LEADTOOLS is backed up by a 30-day money back 72888 Consiinver Products 

guarntee (US & Canada only) and FREE technical support is | IALOGS Veponrahe 


available via phone, fax, Internet, CompuServe or BBS. 


FlashPix™ MODULE 


Integrate the full FlashPix experience. 

¢ FlashPix viewing transforms. 

¢ FlashPix non-image data. 

¢ FlashPix thumbnails. 

* Load and Save portions of an image. 


CUSTOMIZE NOW 


CUSTOMIZE NOW 


VIDEO MODULE 


Integrate extended multimedia capabilities. 
¢ Motion video editing. 

¢ Record, Create, Edit, Play and Save AVI files. 
¢ Record, Play, and save WAV files. 

¢ Play Midifiles. 


ALSO AVAILABLE 


Order now: 800-637-1837 


900 Baxter St. 


Charlotte, NC 28204 


704-332-5532 Fax: 704-372-8161 CompuServe:“GO LEADTECH” 





PRO EXPRESS 


Top of the line LEADTOOLS toolkit. 
* Document Imaging. 

¢ Annotation. 

* Xerox TextBridge® OCR. 

¢ Bi-fonal Filters. 


TISCHINOLOGIES 


Boece. Offers, 


LEADTOOLS i is available in several versions, not all features are available in all versions. *License required from Unisys for formats using LZW compression. FlashPix Module and Pro Express require royalties. 
is a Registered Trademark of the Xerox n. LEAD and LEADTOOLS are registered trademarks of LEAD Technologies, Inc. All other product names are trademarks of their respective owners. 
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Visual Studio Pro Upgrade 
by Microsoft Corporation 


“After 100 Rebate. 






VBCommande é 


Practical and Powerful Visu 
Basic 5.0 IDE Productivity Too! 
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nding Programmers Capabilities 
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Active ToolBox Pro 
by Greentree 
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$359 
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MKS Toolkit 
by MKS 






vith your existing environment. 


__ Windows-based installation editing/ 
testing environment. 


Visual SlickEdit 


development productivity, reduces 





Programmer's Paradise’ 


the developer's definitive source for software! 


WISE Installation 
System Enterprise 
Edition 

by WISE Solutions 

The WISE Installation System 
Enterprise Edition incorporates the 
award-winning WISE Installation 
System, and the tools required for 
any type of application deployment. 
The Enterprise Edition includes SmartPatch, 
WebDeploy, and SetupCapture. The WISE 
Installation System creates professional 
installation programs for Windows, Windows 95, 
and Windows NT. WISE is a completely 


INSTALLATION 


SYSTEM Eateprise 
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G10 0230-FT 
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by MicroEdge 


This award-winning editor increases 


costs of software maintenance and 
improves software quality through 
powerful features, software 
standardization and compatibility 


Software Standardization 


‘gh With its multi-platform presence, integration with industry 


The Hottest Develor 





Visual SlickEdit provides your entire 
~ organization with a standard 
“coding environment. 


includes FULL SOURCE CODE! 


Standard. Full Source. No Royalties. 





leading development environments and 
compatibility with version control systems, Paradise No. 


M39 0122 -FT 


Object 
Grid 6.0 

by Stingray 
Software, Inc. 
Object Grid is the 
most full-featured 
and economical 
grid control for Visual C++. The latest 
release supports Excel-compatible 
formulas, IntelliMouse, and any OCX 
in a cell! This builds on the excellent 
support for OLE/DB, DAO, ODBC and 


the grid’s ability to be bound to any 
cell. Written in 100% C++/MFC and 


STINGRAY 


Software 
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c-tree Plus® 
by FairCom 

DOS ¢ WINDOWS e NT @ UNIX e 
OS/2 ¢ SUN e RS6000 ¢ HP9000 
MAC @ ONX @ BANYAN e SCO. 
This well known, highly-portable 
data management package has 
become established as the too! of 
choice for commercial development. 
Offering unprecedented data 
control, choose from direct low 
level access, ISAM level, or 
ODBC access with the FairCom 
Server. Single User, Multi User, 
or optional Client/Server, ANS! 
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other enhancements! 














ProEssentials’ 

by GigaSoft* 

16- and 32-bit DLL, OCX, VCL, 
and VBX interfaces providing 
charting functionality with 
consistent visual quality, 
real-world practicality, and 
overall professional appeal. 
Suited for engineering, 
financial, data-acquisition, 
and information system development. Be sure to find the best 
tool for your important development needs. Download a 
demo [500K] or get a fully functional 
evaluation edition [3Meg] at 





Paradise No. 


‘www.pparadise.com/publishers/gigasoft. © 4 9110-FT 


$369 





VBTools 6 
by BeCubed Software 
Update your 16-bit applications 
today with any of the more than 60 
custom 16-bit VBX controls! You'll 
save bunches of time with the new 
International control because 

it assigns internationalized text to 
all loaded controls! And 
there's a Flow Charting 
control, a 2D Slider, a 
Floating Text Extender 
and more, plus many 


activex 
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VBTools 6 


More than 60 16-bit 
VBX Controls 
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$119 


Ultimate Grid Studio 


Professional Edition 
by Dundas Software 
Here it is! Now you can get 
the Ultimate collection of grid 
components from one reliable 
source. Each component is 
designed to be the best in 
its class. Package includes 
Ultimate MFC, Activex, 
Java, and SDK grids. 
Components also 
available separately. 
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$549 


Spread v2.5 

by FarPoint Technologies 
A complete spreadsheet contro! 
for most environments that 
support a VBX, OLE control, or 
DLL. Use as a spreadsheet to 
obtain variable lines of data or 
display tables of information. 
Data-aware—connect to 
databases with the Access Engine 
and ODBC. Includes over 250 
properties and our improved 





_ Spread Designer. Formulas, sorting, 
and full-print support too. 


Paradise No. 
FO02 0110-FT 


$229 





Order online 24 hours a day, every day at www.pparadise.com 





Visit WWW. ppar a dise «COIN for the full story on these and more great 


products! Use our product search button & order online 24 hours a day, everyday! 


The Fastest & Easiest Way to Create Help Systems 


RoboHELP 5 





by Blue Sky® Software 


The Best Selling Help Authoring Tool 


RoboHELP provides rich support for all Windows Help platforms and 
the emerging HTML-based Help standards. RoboHELP 5 turns 
Microsoft Word 97 and Word 7 into full-featured authoring tools 
capable of creating professional Windows Help, Microsoft HTML 
Help, Netscape NetHelp, printed documentation, and intranet/Internet 
Web sites—all from a single source. *based on independent survey. 





WinHelp Office 5 
: : The Complete Help Authoring Solution 
minhielp Office RoboHELP Office includes RoboHELP”, the Moving-to-HTML Kit, Help RoboHlELP 
Paradise No. Video Kit, Mastering Help, Help Tool Kit, and RoboHTML—at no extra Paradise No. 
B13 0221 -FT cost—a savings of $499 (for a limited time only). If you're serious B13 0121-FT 
$665 about Help authoring, this is the tool for you. $475 
LEADTOOLS Activex RoboHTML 
Pro 16/32 by Blue Sky® Software 
by LEAD Technologies, Inc The Ultimate Way to Z 
LEADTOOLS offers technology for all Author HTML Help 


major imaging categories for color, 
grayscale and bitonal imaging, with 
comprehensive functionality in each 
category. LEADTOOLS is an integrat- 
ed development toolkit with more 
than 500 functions, properties and 
methods. Common Dialog boxes make LEADTOOLS easier to use 
and add-on modules are available for additional FlashPix, OCR, 
and Video support. 
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VForm 
MFC Control 
by TCK Software, Inc. 


Finally, scrolling multi-row 
virtual MFC forms. High- 
performance, object-oriented 
design can utilize ANY 

data source (ODBC, DAO, 
OLEDB, memory arrays,....) 
Totally customizable and 
extendible. Supports multiple 
row types, morphing rows, Tree 
Control emulation, record selectors, 
headers, footers, bitmaps, animation, and 
much more! No Royalties. Full Source 
also available. 
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Codewright Pro 
by Premia 

You'll work faster and easier 
when you use the right 
programmer's editor, 
Codewright. Now you'll 
browse code faster with 
Outline Symbols. It gathers 
information about your code 
in the background. You'll juggle changes 
with ease, with Difference Editing. It lets you z 
selectively combine the changes from two 
revisions. The new Bookmarks Window lets 
you view bookmarks by name and by file. Now 
with synchronizing technology for Delphi and 
Visual C++ IDEs. With the help of the API 
Assistant, making complex function calls is 

as simple as filling in a form. 
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$199 
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drag-and-drop capabilities. 
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RoboHTML is an HTML Help 
Authoring tool that provides a rich 
WYSIWYG editor with full drag and 
drop support, automated project 
management, and complete testing 
features. Unlike HTML editors, 
RoboHTML has built-in support for all 

the HTML Help features such as: Full drag and drop support for 
creating dynamic Table of Contents, multi-level 
Indexes, and Related Topics. Also supports 
easy creation of Popups, Navigation Buttons, 
Splash Screens, Shortcuts, Import existing 
WinHelp projects and more. 
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B13 0310-FT 


$489 


Multi-Edit 
With WebLair 


by American Cybernetics 
Multi-Edit for Windows is a 
powerful, flexible programmer's 
editor. 32-bit native kernel is 
extremely fast with interface 
enhancements to further increase 
productivity. Comprehensive lan- 
guage support, integral FTP 
Support, VCS and Compiler 
Integration make Multi-Edit the 
perfect editor for all programming 
needs. WebLair (a complete web 
development environment) and the 
Borland IDE Integration Package 
are included. 
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$155 


List Pro 

by FarPoint Technologies 

A set of customizable List Box and 
Combo Box controls. Create header 
groups and sub header groups 
and/or multiple columns to create 

a very unique list control. Merge 
cells in rows with same text to 

make display more user friendly. 
Bind to supported databases using 
Microsoft Jet Database Engine or 
Microsoft Access ODBC. Virtual 
memory manager allows connection 
to any size database. Supports single 
or multi-select lists, embedded icons 
and bitmaps in columns and rows, plus 
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Microsoft: 


Where Do You Want To Go Today? 


Co-Sponsored By 





Turn to the Programmer's Paradise 
catalog to find top-quality software 
development products from your 
favorite software publishers. 

Here's a small sampling of what you ll 
find in the Programmer's Paradise 
catalog. For assistance, call us—today! 


A Personal Guarantee! 

“If you're not happy, we're not happy! 
You have my personal guarantee that 
we ll provide you with outstanding 
products, support and service.” 
—Roger Paradis 

President & CEO 
Programmer's Paradise, Inc. 


ORDER NOW 


or call for your free catalog! 


1-800-445-7899 


Call for shipping charges/return policy. 
Prices subject to change without notice. 
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Even Netscape® Navigator® and Microsoft® Internet Explorer share this common thread—a language. 
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Jonathan Erickson 


nnually, Dr. Dobb’s Journal honors individuals who have 
made significant contributions to the advancement of soft- 
ware development. Past recipients of Dr. Dobb’s Excellence 
in Programming Award include: 


e Alexander Stepanov, developer of the C++ Standard Template 
Library. 

e Linus Torvalds, the force behind the Linux operating system. 

e Larry Wall, author of the Perl language. 

e James Gosling, chief architect of Java. 

¢ Ronald Rivest, educator, author, and computer security expert. 

¢ Gary Kildall, a computer pioneer in the areas of operating sys- 
tems, programming languages, and user interfaces. 


Rather than an individual, the recipient of this year’s Excel- 
lence in Programming Award is a team of researchers widely 
known as the “Gang of Four’—Erich Gamma, Richard Helm, 
John Vlissides, and Ralph Johnson. Although they did not in- 
vent design patterns or even write the first book on the sub- 
ject, the GoF’s Design Patterns: Elements of Reusable Object- 
Oriented Software (Addison-Wesley, 1995) can be credited with 
bringing patterns into the mainstream of software development. 
In addition to enjoying widespread use within the software-de- 
velopment community, patterns have become standard fare in 
leading object-oriented design methodologies, notations, and 
environments. Commercial class libraries provide reusable de- 
sign patterns. And research efforts are underway to build tools 
that generate source code from design patterns and even in- 
troduce built-in support for design patterns into programming 
languages. 

Interestingly, the concept of design patterns, which express 
relationships between contexts, recurring problems, and proven 
solutions, have been around for years in other disciplines, most 
notably architecture and urban planning. Much of the work with 
software-design patterns, in fact, has its roots in Christopher 
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Alexander’s book A Pattern Language: Towns, Buildings, Con- 
struction (Oxford University Press, 1977). 

Patterns began moving into the realm of software develop- 
ment in the late 1980s, when Ward Cunningham and Kent 
Beck applied Alexander’s ideas to a small pattern language 
for Smalltalk programmers. This project was presented at OOP- 
SLA ’87 in a paper entitled “Using Pattern Languages for Object- 
Oriented Programs.” Intrigued with the idea of patterns, others 
(James Coplien, Desmond DeSouza, Doug Lea, Richard Gabriel, 
and Grady Booch among them) continued collecting and shar- 
ing idioms and patterns. 

Still, it was at Bruce Anderson’s OOPSLA ’90 Birds-of-a- 
Feather session “Towards an Architecture Handbook” where 
Richard Helm and Erich Gamma met and discovered a com- 
mon vision about reusable object-oriented software. Gamma 
and Helm came together again at ECOOP ’91, where they be- 
gan putting together a catalog of patterns. The need for a defini- 
tive collection of good software design patterns quickly be- 
came apparent, and Gamma and Helm, along with John 
Vlissides and Ralph Johnson, published Design Patterns in 1995, 
which has been described as a book of design patterns that 
describes simple and elegant solutions to specific problems in 
object-oriented software design. 

Coinciding with the rush to the object paradigm, Design Pat- 
terns quickly became a classic, selling more than 100,000 copies 
since publication. Although numerous books on software- 
design patterns have been published since, none have matched 
the stature or acceptance of the GoF’s Design Patterns. 

Erich Gamma started to discover design patterns while work- 
ing and reflecting on the ET++ (a portable C++ class library and 
application framework for developing interactive graphical appli- 
cations) while at the University of Ziirich. This eventually grew 
into his Ph.D. thesis, which described a first set of patterns. After 
university, Gamma moved on to the research lab of Union Bank 
of Switzerland, where he applied objects and frameworks in the 
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From left to right: Ralph Johnson, Richard Helm, 
Erich Gamma, and John Vlissides. 


finance domain. From 1993 to 1995, Gamma worked 
at Taligent on an advanced incremental C++ develop- 
ment environment. Erich is currently technical direc- 
tor at Object Technology International’s Software Tech- 
nology Center in Zurich, Switzerland. 

Richard Helm is a member of IBM Consulting 
Group, IBM Global Services, in Sydney, Australia. 
Before that, he was a technology consultant with 
the DMR Group, a technology consulting firm 
headquartered in Montreal, Canada. Helm, who 
holds a Ph.D. in computer science from the Uni- 
versity of Melbourne, was also a member of the software 
technology department at IBM T.J. Watson Research. In 1995-96, 
Helm and Gamma coauthored the popular “Patterns and Soft- 
ware Design” column for Dr. Dobb’s Sourcebook. 

Although he grew up in the Congo (where his parents were 
missionaries), Ralph Johnson graduated from Knox College in 
Illinois, and received a Ph.D. in computer science from Cornell 
University. Johnson has been at the University of Illinois for 12 
years, where he is a faculty member in the Department of Com- 
puter Science and Coordinator of Project Design Activities. He 
has been involved in the development of an object-oriented op- 
erating system (Choices), compiler (Typed Smalltalk), graphics 
editor framework (HotDraw), and a music synthesis system 
(Kyma). Johnson was program chair of OOPSLA ’93 and con- 
ference chair of the first conference on Pattern Languages of 
Programming (PLoP) in 1994. 

John Vlissides is currently a researcher at IBM’s T.J. Watson 
Research Center in New York, where his research interests are 
in object-oriented design tools and techniques, application 
frameworks and builders, object-oriented visualization, and 
tools for user-interface development. Before joining IBM, Vlis- 
sides was post-doctoral scholar in the Computer Systems Lab- 
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oratory at Stanford Uni- 
versity where he code- 
veloped InterViews, a 
widely used set of li- 
braries and tools for de- 
veloping graphical appli- 
cations. His doctoral work 
was on Unidraw, one of 
the earliest frameworks for 
building graphical editors. 
He also developed numerous 
InterViews- and Unidraw- 
based applications, including 
the idraw drawing editor and 
os the ibuild user-interface builder. 
He has served as a consultant to 
several companies both before 
and after joining IBM. In addition 
to Design Patterns, Vlissides is coauthor of Object-Oriented 
Applications Frameworks and Pattern Languages of Program 
Design 2. He is also consulting editor of Addison-Wesley’s 
Software Patterns Series. Vlissides also writes the “Pattern 
Hatching” column for The C++ Report. Vlissides has a Ph.D. 
in electrical engineering from Stanford University. 

The Gang of Four’s contribution to software development will 
be acknowledged at the Software Development 98 Conference 
in San Francisco and Dr. Dobb’s Journal will make available fi- 
nancial grants to university programs of their choice. 

Please join us in honoring Erich Gamma, Richard Helm, Ralph 
Johnson, and John Vlissides. As with previous recipients of this 
award, they continue to remind us that a mix of technology, in- 
novation, vision, and cooperative spirit continue to be funda- 
mental software-development principles. 


DDJ 
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Measuring software 
quality to 
support testing 





Shari Lawrence Pfleeger 


n an ideal situation, we become so 

good at our craft that every program 

works properly every time it is run. Un- 

fortunately, this is not reality. For one 
thing, many software systems deal with 
large numbers of states and complex for- 
mulae, activities, and algorithms. In addi- 
tion, we have to implement a customer’s 
conception of a system even though the 
customer may be uncertain of exactly what 
is needed. Finally, the size of a project 
and number of people involved can add 
complexity. Thus, the presence of faults 
is not merely due to errors in the software, 
but also depends on user and customer 
expectations. 

What does it mean when we say soft- 
ware has failed? Usually, it means that 
the software does not do what the re- 
quirements describe. For example, the 
specification may state that the system 
must respond to a particular query only 
when users are authorized to see the 
data. If the program responds to unau- 
thorized users, we say it has failed. There 
can be several reasons for failure, in- 
cluding: 


e The specification may be wrong. It may 
not state exactly what the customer 


Shari is president of Systems/Software and 
author of Software Engineering: Theory 
and Practice (Prentice Hall, 1998). Shari 
can be contacted at s.pfleeger@ieee.org. 
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wants or needs. For example, the cus- 
tomer may actually want to have sever- 
al categories of authorization, with each 
category having a different kind of ac- 
cess, but that requirement may not have 
been stated explicitly. 

e The specification may contain a re- 
quirement that is impossible to imple- 
ment, given the prescribed hardware 
and software. 

e The system design may contain a fault. 
Perhaps the database and query lan- 
guage designs make it impossible to au- 
thorize users. 





e The program design may contain a fault. 
The component descriptions may con- 
tain an access-control algorithm that 
does not handle a case correctly. 

e The program code may be wrong. It 
may implement the algorithm improp- 
erly or incompletely. 


No matter how capably you write pro- 
grams, you should ensure that compo- 
nents are coded correctly. Many pro- 
grammers view the object of testing as a 
demonstration of their programs per- 
forming properly. However, demonstrat- 


ing correctness is actually antithetical to 
what testing is about. We test a program 
to demonstrate the existence of a fault. 
Because the goal is to discover faults, we 
consider a test successful only when a 
fault is discovered or a failure occurs. Fault 
identification is the process of determin- 
ing what fault(s) cause a failure, and fault 
correction (or fault removal) is the pro- 
cess of making changes to the system so 
that the fault is removed. 


Orthogonal Defect Classification 
It is useful to categorize and track the types 
of faults you find, not just in code but any- 
where in a software system. Historical in- 
formation can help you predict what types 
of faults your code is likely to have (which 
helps direct testing efforts), and clusters 
of certain types of faults can warn you 
that it may be time to rethink designs or 
requirements. Many organizations perform 
statistical fault modeling and causal analy- 
sis, both of which depend on under- 
standing the number and distribution of 
types of faults. For example, IBM’s Defect 
Prevention Process (described in “Experi- 
ences with Defect Prevention,” R. Mays et 
al., JBM Systems Journal, 29, 1990) seeks 
out and documents the root cause of ev- 
ery problem that occurs. The information, 
which is used to suggest the types of faults 
testers should look for, has reduced the 
number of faults injected in the software. 
In “Orthogonal Defect Classification: A 
Concept for In- process Measurements” 
(EEE Transactions on Software Engineer- 
ing, November 1992), Ram Chillarege et 
al. at IBM developed an approach to fault 
tracking called “orthogonal defect classi- 
fication,” in which faults are placed in cat- 
egories that collectively paint a picture of 
which parts of the development process 
need attention because they are respon- 
sible for spawning many faults. Thus, the 
classification scheme must be product- 
and organization-independent and must 
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(continued from page 22) 

be applicable to all stages of development. 
Table 1 lists the types of faults that make 
up IBM’s classification. When using the 
classification, developers identify not only 
the type of fault, but whether it involves 
something that is missing or is incorrect. 

A classification scheme is orthogonal if 
any item being classified belongs to ex- 
actly one category. In other words, you 
want to track the faults in the system in 
an unambiguous way, so that the sum- 
mary information about the number of 
faults in each class is meaningful. You lose 
the meaning of the measurements if a fault 
belongs to more than one class. Likewise, 
the fault classification must be clear, so 
that any two developers can classify a par- 
ticular fault in the same way. 

Fault classification, such as IBM’s and 
Hewlett-Packard’s (see the accompanying 
text box entitled “Hewlett-Packard’s Fault 
Classification”), improves the entire de- 
velopment process by telling you which 
types of faults are found in which devel- 
opment activities. You can build a profile 
of the types of faults located by each of 
the fault-identification or testing techniques 
you used while building the system. It is 
likely that different methods will yield dif- 
ferent profiles. Then you can build your 
fault prevention and detection strategy 
based on the kinds of faults you expect 
in the system and the activities that will 
root them out. Chillarege et al. illustrate 
IBM’s use of this concept by showing that 
the fault profile for design review is very 
different from the fault profile for code in- 
spection. 


Software Quality 

Software quality can be measured in many 
ways. One way to assess the “goodness” 
of a component is by the number of faults 
it contains. It seems natural to assume that 
the most elusive software faults are also 
the most difficult to correct. It also seems 
reasonable to believe that faults that are 
easy to fix are detected early on, while 
the more difficult faults are located later 
in the testing process. 

However, M.L. Shooman and M. Bolsky 
found this is not the case (see “Types, Dis- 
tribution and Test and Correction Times 
for Programming Errors,” Proceedings of 
the 1975 International Conference on Re- 
liable Software, EEE Computer Society 
Press, 1975). It can take some time to find 
trivial faults, and many such problems are 
overlooked or do not appear until well 
into the testing process. Moreover, Glen- 
ford Myers reports in The Art of Software 
Testing (John Wiley & Sons, 1979) that as 
the number of detected faults increases, 
the probability of the existence of more 
undetected faults increases. If there are 
many faults in a component, you want to 
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find them as early as possible in the test- 
ing process. However, if you find a large 
number of faults at the beginning, then 
you are likely to have a large number of 
yet undetected faults. 

In addition to being contrary to intu- 
ition, these results also make it difficult to 
know when to stop looking for faults dur- 
ing testing. You must estimate the num- 
ber of remaining faults, not only to know 


Fault classification 
improves the entire 
development process 





when to stop searching for more, but also 
to give some degree of confidence in the 
code you are producing. The number of 
faults also indicates the maintenance ef- 
fort that will probably be needed if faults 
are left to be detected after the system is 
delivered. 

Harlan Mills developed a technique 
known as “fault seeding” or “error seed- 
ing” to estimate the number of faults in 
a program (see “On the Statistical Vali- 
dation of Computer Programs,” Techni- 
cal Report FSC-72-6015, IBM Federal Sys- 
tems Division, 1972). His basic premise 
is that one member of the test team in- 
tentionally inserts (“seeds”) a known num- 
ber of faults into a program. Then, other 


team members locate as many faults as 





they can. Assuming that the ratio of seed- 
ed faults detected to total seeded faults 
is the same as the ratio of nonseeded 
faults detected to total nonseeded faults, 
we can estimate the number of indige- 
nous faults remaining in the program. 
Thus, if a program is seeded with 100 
faults and the test team finds only 70, it 
is likely that 30 percent of the indigenous 
faults remain in the code. 

To express this ratio more formally, let 
S be the number of seeded faults placed 
in a program, and let V be the number of 
indigenous (nonseeded) faults. If 7 is the 
actual number of faults detected during 
testing, and s is the number of seeded faults 
detected during testing, then an estimate 
of the total number of indigenous faults is: 


N=Sn/s 


Although simple and useful, this ap- 
proach assumes that the seeded faults are 
of the same kind and complexity as the 
actual faults in the program. But you do 
not know what the typical faults are be- 
fore you’ve found them, so it is difficult 
to make the seeded faults representative 
of the actual ones. One way to increase 
the likelihood of representativeness is to 
base the seeded faults on historical records 
for code from similar past projects. How- 
ever, this approach is useful only when 
you have previously built similar systems. 
And things that seem similar may, in fact, 
be quite different in unanticipated ways. 

To overcome this obstacle, you can use 
two independent test groups to test the 
same program—Test Group 1 and Test 
Group 2, for instance. Let x be the num- 
ber of faults detected by Test Group 1, 
and y the number detected by Test Group 
2. Some faults will be detected by both 
groups; call this number of faults g, so 

(continued on page 28) 





Table 1: IBM orthogonal defect classification. 
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Let’s face it: If you’re looking 
to spot the source of most 
performance bottlenecks, you're 
spending an inordinate amount 
of time operating in the dark. 
Unless, of course, you're using 
Visual Quantify” for your 


performance tuning. By clearly 


displaying performance data in graphical and text 
format, Visual Quantify will open your eyes to a whole 
new way of pinpointing the portions of code that slow 


down execution speed. With Rational’s patented 
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(continued from page 24) 

that g<x and g§y. Finally, let 7 be the to- 
tal number of all faults in the program; 
you want to estimate 7. 

The effectiveness of each group’s test- 
ing can be measured by calculating the 
fraction of faults found by each group (see 
the accompanying text box “Measuring 
Test Effectiveness”). Thus, the effective- 
ness E, of Group 1 can be expressed as: 


E,=x/n 
and the effectiveness £, of Group 2 as: 
E,=y/n 


The group effectiveness measures the 
group’s ability to detect faults from among 
a set of existing faults. Thus, if a group 
can find half of all faults in a program, its 
effectiveness is 0.5. Consider faults detected 


by both Group 1 and Group 2. Assuming 
that Group 1 is equally effective at find- 
ing faults in all parts of the program, you 
can look at the ratio of faults found by 
Group 1 from the set of faults found by 
Group 2. That is, Group 1 found g of the 
y faults that Group 2 found, so Group 1’s 
effectiveness is q/y. In other words: 


E,=x/n=q/y 


However, you know that £, is y/n, so you 
can derive the following formula for 7: 


n=q/(E, +E) 


You have a known value for g, and you 
can use estimates of g/y for E, and g/x for 
E,, so you have enough information to es- 
timate 7. 

To see how this method works, sup- 
pose two groups test a program. Group 


- vat ee of faults o 
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1 finds 25 faults. Group 2 finds 30 faults, 
15 of which are faults also found by Group 
1. So, you have: 


The estimate, £,, of Group 1’s effec- 
tiveness is g/y, or 0.5, since Group 1 found 
15 of the 30 faults found by Group 2. Sim- 
ilarly, the estimate, E,, of Group 2’s ef- 
fectiveness is g/x, or 0.6. Thus, your esti- 
mate of 7 (the total number of faults in 
the program) is 15/(.5*.6), or 50 faults. 

The test strategy defined in the test plan 
can use this estimating technique to de- 
termine when testing is complete. 


Confidence in the Software 

You can use fault estimates to tell how 
much confidence you can place in the 
software being tested. Confidence, usual- 
ly expressed as a percentage, indicates the 
likelihood that the software is free of faults. 
That is, if you say a program is fault free 
with a 95 percent level of confidence, then 
you mean that the probability that the soft- 
ware has no faults is 0.95. 

Suppose you have seeded a program 
with S faults, and claim that the code has 
only N actual faults. You test the program 
until you’ve found all S of the seeded 
faults. If, as before, 7 is the number of ac- 
tual faults discovered during testing, then 
the confidence level can be calculated as: 


1 if n>N 
C=) SAS-N+1) if n<N 


For example, suppose you claim that 
a component is fault free, meaning that 
N is zero. If you seed the code with ten 
faults and find all ten without uncover- 
ing an indigenous fault, then you can cal- 
culate the confidence level with S=10 and 
N=0. Thus, C is 10/11, for a confidence 
level of 91 percent. If the requirements 
or contract mandate a confidence level 
of 98 percent, you would need to seed 
S faults, where S/(S—0+1) = 98/100. Solv- 
ing this equation, you see that you must 
use 49 seeded faults and continue test- 
ing until all 49 faults were found (with 
no indigenous faults discovered). 

This approach presents a problem: You 
cannot predict the level of confidence un- 
til all seeded faults are detected. In his pa- 
per “Computer Software: Testing, Relia- 
bility Models, and Quality Assurance” 
(Technical Report NPS-55RH74071A, Naval 
Postgraduate School, 1974), E.R. Richards 
suggests a modification, where the confi- 
dence level can be estimated using the num- 
ber of detected seeded faults, whether or 
not all have been located. In this case, C is: 


1 if n>N 
O(S-2) * (Nas?) fins 
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These estimates assume that all faults 
have an equal probability of being de- 
tected, which is not likely to be true. How- 
ever, many other estimates take these fac- 
tors into account. Such estimation 
techniques not only give you some idea 
of the confidence you can place in your 
programs, but also provide a side bene- 
fit. Programmers may be tempted to con- 
clude that each fault discovered is the last 
one. If you estimate the number of faults 
remaining, or if you know how many 
faults you must find to satisfy a confidence 
requirement, you have incentive to keep 
testing for one more fault. 

These techniques are also useful in as- 
sessing confidence in components that 
are about to be reused. You can look at 
the fault history of a component, espe- 
cially if fault seeding has taken place, and 
use such techniques to decide how much 
confidence to place in reusing the com- 
ponent without testing it again. Or, you 
can seed the component and use these 
techniques to establish a baseline level of 
confidence. 


Identifying Fault-Prone Code 

Many techniques for identifying fault- 
prone code use fault histories in similar 
applications. For example, some re- 
searchers track the number of faults found 
in each component during development 
and maintenance. They also collect mea- 
surements about each component, such 
as size, number of decisions, number of 
operators and operands, or number of 
modifications. Then, they generate equa- 
tions to suggest the attributes of the most 
fault-prone modules. These equations can 
be used to suggest which of your com- 
ponents should be tested first and which 
should be given extra scrutiny. 

In “Empirically Guided Software De- 
velopment using Metric-based Classifica- 
tion Trees” WEEE Software, March 1990), 
Adam Porter and Richard Selby suggest 
using classification trees to identify fault- 
prone components. Classification-tree 
analysis is a statistical technique that sorts 
through large arrays of measurement in- 
formation, creating a decision tree to show 





Figure 3: Classification tree to 
identify fault-prone modules. 
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Measuring Test Effectiveness 


2 aspect of test planning and re- 
ting is measuring test effective- _ 
s. In Messunng the Effective- 
be calculated by level. In this way, inte- 
- gration testing might be 50 percent ef- 
_ fective at finding critical faults, but 80 
— percent effective at finding minor faults. 
_ Alternatively, test effectiveness may be 


| fouad ie the test). For ea sup- 
pose integration testing finds 56 faults, 


and the total testing process finds 70 


faults. Then Graham’s measure of test 
_ effectiveness says that integration test- 


ing was 80 percent effective. However, 


es the system is delivered after 


0 faults are found, and 70 addi- a 
faults are fea during the - 


st six months of operation. Then in- 
tegration testing is responsible for find- 


ing 56 of 140 faults, or only 40 percent _ 


test effectiveness. 
‘This approach to 2S the im- 


pact of a particular testing phase or tech- 


which measurements are the best predic- 
tors of a particular attribute. For instance, 
suppose you collect measurement data 
about each component built in the orga- 
nization. You include size (in lines of 
code), number of distinct paths through 
the code, number of operators, depth of 
nesting, degree of coupling and cohesion 
(rated on a scale from one as lowest to 
five as highest), time to code the compo- 
nent, number of faults found in the com- 
ponent, and more. You use a classifica- 
tion-tree analysis tool (such as C4, CART, 
or the Trellis graphics capabilities of S- 
Plus; see “Object-Oriented Programming 
in S,” by Richard Calaway, DDJ, October 
1995) to analyze the attributes of the com- 
ponents that had five or more faults, com- 
pared with those that had less than five 
faults. The result is a decision tree like that 
in Figure 3. 

The tree is used to help you decide which 
components in your current system are like- 
ly to have a large number of faults. Ac- 
cording to the tree, if a component has be- 
tween 100 and 300 lines of code and has at 
least 15 decisions, then it may be fault prone. 
Or, if the component has over 300 lines of 
code, has not had a design review, and has 
been changed at least five times, then it, too, 
may be fault prone. You can use this type 
of analysis to help target testing when re- 
sources are limited. Or, you can schedule 
inspections for such components, to help 
catch problems before testing begins. 


nique can be adjusted in “several ways. 
For example, failures can be assigned a 
severity level, and test effectiveness can © 


combined with root cause analysis, so 


that we can describe effectiveness in find- 
ing faults as early as possible in devel- 


opment. For example, integration testing — 
may find 80 percent of faults, but half of 
those faults might have been discovered. 


_ earlier, such as during design review, be- 


cause they are design problems. | 
Test efficiency is computed by divid- — 
ing the number of faults found in test- 


ing by the cost of testing, to yield a val- 
ue in faults per staff hour. Efficiency 


measures help us to understand the cost — 
of finding faults, as well as the relative 
costs of finding them in different phas- 


es of as etotne process. 


—S.L.P. 





Measure for Measure 

The metrics described here represent only 
some of the ways that measurement can 
assist you in testing your software. Mea- 
surement is useful in at least three ways: 


e Understanding your code and its design. 

e Understanding and controlling the test- 
ing process. 

e Predicting likely faults and failures. 


You can measure processes, products, 
and resources to make you a better-edu- 
cated tester. For example, measures of the 
effectiveness of the testing process suggest 
areas where there is room for improvement. 
Measures of code or design structure high- 
light areas of complexity where more un- 
derstanding and testing may be needed. 
And measuring the time it takes to prepare 
for tests and run them helps you under- 
stand how to maximize your effectiveness 
by investing resources where they are most 
needed. You can use the goals of testing 
and development to suggest what to mea- 
sure, and even to define new measurements 
tailored to your particular situation. The ex- 
perience reported in the literature is clear: 
Measurement is essential to doing good test- 
ing. It is up to you to find the best ways 
and best measures to meet your project’s 
objectives. 


DDJ 
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Making a 166-MHz 
Pentium perform like a 
300-MHz Pentium IT 








Robert R. Collins 


ecently, I met with someone who 

has made a career benchmarking 

computer products. In fact, he found 

the business so lucrative he quit his 
day job to capitalize on the benchmark- 
ing business. Over dinner, he said, “Tell 
me what results you want, and I'll choose 
the benchmark to give you those results.” 
This didn’t surprise me, as I’ve been in- 
volved with benchmarking throughout my 
career. However, ordinary consumers 
might be surprised by his comment— es- 
pecially if they've ever bought a comput- 
er product because of published bench- 
mark results. 

Last year, the editors at a popular com- 
puter magazine bemoaned the task of 
benchmarking an AMD K6-based com- 
puter. They complained that the compar- 
ison between the AMD-based PC and an 
Intel Pentium MMX-based PC they previ- 
ously tested couldn’t be fair because the 
AMD-based system had a higher perfor- 
mance disk subsystem. 

At the time of the AMD test, the Sea- 
gate Cheetah Ultra-Wide SCSI hard drive 
was new to the market; it wasn’t even 
available for beta testing when the Intel 
computer had been tested. The Cheetah 


kobert is an independent consultant on 


the x86 architecture. He can be reached 
at rcollins@x86.org. 
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Benchmarks: Fact, 
Fiction, or Fantasy? 


was the state-of-the-art in disk perfor- 
mance. At the time of the Intel test, how- 
ever, state-of-the-art was the Seagate Bar- 
racuda 4LP Ultra-Wide SCSI hard drive. 
Both AMD and Intel were guilty of stack- 
ing the deck in their favor by configuring 
a benchmark computer with the highest 
performance peripherals that were avail- 
able on the market. Therefore, the mag- 
azine’s complaints seemed disingenuous. 

There’s nothing new about vendors load- 
ing a computer with higher-performance 
peripherals for benchmarking purposes. We 
did it when I worked at Acer; Intel did it 


Paste 


for the Pentium MMxX tests. So why would 
the magazine editors complain when AMD 
did the same thing? I suppose if they re- 
ally were concerned with this problem, 
the editors could have put the Cheetah in 
the Intel computer or the Barracuda 4LP 
in the AMD system, and rerun the tests. 
Instead, it was easier to complain about 
the inherent inequity in the test than try 
to do something about it. The end result 
is that the article implied AMD had been 
somewhat deceptive. 











Instances of computer manufacturers 
submitting computers with the highest 
performance peripherals available is com- 
mon in the benchmarking industry. Ven- 
dors know that high-performance pe- 
ripherals can hide a multitude of bad 
designs. Thus, benchmark results don’t 
end up reflecting a good computer de- 
sign— they end up reflecting how well 
the various components work within the 
computer system. 

In general, I’ve found there are three 
basic types of benchmarks: 


e Those that attempt to gauge overall sys- 
tem performance. 

e Those that attempt to measure specific 
performance, such as raw compute pow- 
er or bandwidth of specific subsystems. 

e Those that are only intended for market- 
ing purposes or to mislead and deceive. 


Benchmark suites have been developed 
to serve the needs of business, the most 
popular being BAPCO’s Sysmark (http:// 
www.bapco.com/) and Ziff-Davis’ Win- 
stone (http://www.zdbop.com/). Both Sys- 
mark and Winstone attempt to give you 
a measurement of overall system perfor- 
mance. Raw computing power only plays 
a minor role. Overall throughput is the 
metric of choice. 

Some benchmarks attempt to measure 
specific performance, like microprocessor 
and disk subsystems. WinBench (also at 
http://www.zdbop.com/) and SPEC 
(http://www.spec.org/) are the two most 
popular benchmarks in this category. Win- 
Bench uses a technique called a “playback” 
test, which uses logs of system calls made 
during specific application activities (graph- 
ics calls or disk usage, for example), then 
plays them back in isolation. WinBench 
measures each subsystem of the comput- 
er: disk, CD-ROM, CPU, memory, and 
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graphics. SPEC is intended as a measure- 
ment of raw CPU integer or floating-point 
performance (SpecINT95 and SpecFP95). 

Benchmarks can also be devised for 
marketing purposes, and their published 
results should be carefully weighed. In- 
tel’s ICOMP benchmark (Version 1.0), for 
instance, is an example of a benchmark 
program developed by a microprocessor 
manufacturer for measuring microproces- 
sor performance. ICOMP 1.0 was a pro- 
prietary benchmark, unique to Intel. Intel 
didn’t publish the formula for ICOMP 1.0, 
nor did it allow anyone to license the pro- 
gram. Therefore, the results couldn’t be 
independently verified. Thankfully, Intel 
has abandoned ICOMP 1.0, replacing it 
with ICOMP 2.0. The formula for ICOMP 
2.0 is published at http://www. intel.com/ 
procs/perf/icomp /faxback/ICOMP.HTM, 
making it an open standard that can be 
independently verified. 

In short, vendors know that peripher- 
als make more difference in benchmark 
results than any motherboard design or 
CPU choice. Differences in motherboard 
design would never account for more 
than one or two percentage points of 
benchmark results. Likewise, whether 
you choose an AMD, Cyrix, Intel, or IDT 
586-class CPU will not make much of a 
difference in most benchmark results ei- 
ther. Therefore, unless you have specif- 
ic computing needs, your money would 
be more wisely spent by purchasing a 
slower, cheaper motherboard and CPU, 
and spending the money you’ve saved 





Table 1: Computer systems. 
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on higher-performance peripherals. 





The Challenge 
To prove my point, I will make a Pentium 
running at 166 MHz appear to outperform 
a Pentium IT running at 300 MHz. How 
will I do this? Simple. I'll start with a ba- 
sic 160-MHz computer that is typical of 
home systems. From that point, Ill up- 
grade the graphics and disk subsystems. 
With these two minor changes, I can 
achieve higher (benchmarked) perfor- 
mance than the 300-MHz Pentium II, run- 
ning with the same base components as 
the Pentium 166. 

This demonstration will underscore 
some important principles: 


e Beware of benchmark results. Always 
scrutinize what peripherals are being 
used. If you don’t recognize the pe- 
ripherals, you might as well consider 
the benchmark results meaningless. 
Beware of benchmark comparisons. If 
the motherboard, memory, and periph- 
erals aren’t identical, the benchmark 
comparison is meaningless. Even when 
the peripherals are the same, the results 
can sometimes be meaningless. For ex- 
ample, in one test, the graphics card may 
have been set for 1025x768x(eight-bit) 
colors, while another test may have the 
graphics card set for 1024x768x(24-bit) 
colors. The latter test must write three 
times as much data to the video card. 
Therefore, it will drag down the bench- 
mark results. 

The proper graphics card and disk sub- 
system can substantially increase per- 
formance. You don’t need a 300-MHz 
Pentium II to play Quake (in spite of 


what the Bunny-Geeks tell you). You 
only need to upgrade the components 
of your computer to achieve 300-MHz- 
like performance. 

e Choosing the highest priced components 
doesn’t always produce the highest per- 
formance. 

e It’s easy to cripple the performance of 
a 300-MHz computer— simply choose 
the wrong peripheral components. 


For the purposes of these tests, I’ve se- 
lected benchmark suites from both main 
categories of benchmarking. Winstone and 
Sysmark are representative benchmark 
suites to gauge overall system perfor- 
mance. Winstone is published by Ziff- 
Davis. A consortium of computer and soft- 
ware manufacturers (of which Intel is a 
member) publishes BAPCO’s Sysmark. 
(BAPCO’s office, in fact, is located inside 
Intel’s corporate headquarters.) In gener- 
al, I found the BAPCO’s methodology 
more scientific than Winstone’s. While 
Winstone allows testers to set a dizzying 
array of configuration options, BAPCO 
does not. The infinite variability of Win- 
stone test configurations makes it more 
prone to manipulation for benchmark 
brinkmanship purposes. Therefore, it is 
unfortunate that Intel’s involvement in 
BAPCO has meant the Sysmark suite has 
a low acceptance level in the computer 
industry. 

For low-level benchmarks, I’ve chosen 
WinBench and SpecINT. WinBench at- 
tempts to measure all of the computer sub- 
systems, while SpecINT only measures 
microprocessor integer performance. 

I set up two separate computer systems 
as identically as possible. Table 1 describes 
both systems and my rationale for each 
choice. The computer systems in this table 
are considered the control (constants) of 
my tests. These components will not 
change from configuration to configura- 
tion, or from test to test. It’s important in 
benchmarking to establish a control set. 
Once benchmark results are established 
for the control, the variable components 
can be changed to demonstrate their net 
effect. 

As my variable components, I’ve cho- 
sen two different subsystems. I’m going to 
change the video and disk subsystem in- 
dependently to demonstrate their influence 
on the benchmark results. Tables 2 and 3 
list both subsystems and the components 
I've chosen for my test purposes. 


Problems at Benchmark Central 

Running benchmarks always seems to pre- 
sent its share of problems. I found that 
some benchmarks are temperamental. Win- 
stone 97 can easily time out or even hang 
a computer system. With Winstone 97, I 
discovered that the order in which the 
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(continued from page 32) 
benchmark software and the NT Service 
pack are installed made a difference in func- 
tion and performance. If the Service Pack 
is installed before the benchmark software, 
then the tests often time out. If the Service 
Pack is installed after the benchmark soft- 
ware, then no such problems exist. 
Winstone 97 also gave me problems 
with my IMS Twin Turbo video card. In 
spite of numerous calls to the manufac- 
turer, I could never resolve these prob- 
lems. Eventually I gave up, and told the 
manufacturer that it would have to be 
dropped from the test. It’s too bad, be- 
cause the IMS Twin Turbo was a perfect 
example of “higher cost isn’t always high- 
er performance.” The Twin Turbo cost 
nearly double the price of the Matrox Mil- 


Table 2: Video subsystem. 
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lennium, yet suffered from 30-percent low- 
er performance. This just shows that bet- 
ter peripherals don’t always come in more 
expensive packages. 

The Pioneer 24x CD-ROM had its share 
of problems. When running WinBench, 
the CD-ROM tests sometimes failed. Win- 
Bench reported that the CD-ROM drive 
didn’t contain a disk, and suggested that 
I reinsert the disk before retrying the test. 
I found that this problem was likely caused 
by the high-speed drive being out of bal- 
ance. Often times, when I repositioned 
the CD-ROM drive, the problems would 
go away. I also noticed that the perfor- 
mance of the CD-ROM drive was incon- 
sistent. Again, this was most likely caused 
by the drive being out of balance and is- 
suing multiple retries before successfully 








reading the data on the CD-ROM. 

Lastly, my Micropolis 4345 WAV drive 
blew a head gasket before crossing the fin- 
ish line. I was running my last configura- 
tion on this drive when it quit working. 
The motor would spin up, then spin 
down— over and over. The hard drive nev- 
er booted again. Contacting the manufac- 
turer for a replacement wouldn’t have 
helped: Micropolis is now out of business. 


The Results 

As I expected, benchmarks that measured 
specific computer components were the 
most reliable and gave the most consis- 
tent results. Such benchmarks weren’t 
dramatically affected by high-performance 
video controllers and high-performance 
disk subsystems. SpecINT95 gave consis- 
tent results regardless of configuration 
changes. The Pentium 166 consistently 
scored between 3.53 and 3.55 SpecINT95 
(base). The Pentium II 300 scored con- 
sistently at 8.99 to 9.00 SpecINT95. Be- 
cause of their predicted consistency, I have 
excluded their results from the remainder 
of this discussion— doing so would only 
become redundant. 

The remainder of the results will show 
the dramatic impact that enhanced pe- 
ripherals can have on overall system per- 
formance. Keep in mind that the ultimate 
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goal is to prove that a Pentium 166 can 
show higher performance on systems 
benchmarks than a Pentium II 300. Win- 
stone and Sysmark are our systems- 
benchmark measuring tools. Even though 
WinBench isn’t a systems benchmark, I’ve 
included selected WinBench results to 
demonstrate the impact on the computer 
system of enhancing the video and disk 
subsystems. 

Figure 1 shows the baseline configura- 
tion for my Pentium 166 and Pentium II 
300 tests. The baseline configuration for 
each computer consists of an IDE hard 
drive, and standard VGA video controller. 
All of the results have been normalized to 
this baseline configuration. Hence, all re- 
sults can be considered as percentage dif- 
ferences between configurations. Nor- 
malizing the results gives an easy-to-read 
visual display, which shows how one 
configuration compares to another. (If 
you're interested in the actual benchmark 
results, they’re available electronically in 
an Excel spreadsheet; see “Resource Cen- 
ter,” page 3. The spreadsheet is also avail- 
able at http://www.x86.org/ddj/Mar98/ 
Benchmarks.xls.) 

Some of the results may appear to be 
anomalous. The Pentium II Graphics Win- 
mark and Business Winstone results show 
lower performance than the baseline con- 
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figuration. These results would tend to 
defy logic. As a means to mitigate these 
anomalous results, I ran each benchmark 
test three times, and reported the highest 
results for any given run. This method 
guarantees that the results aren’t anoma- 


Table 3: Disk subsystem. 
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lous, but instead are best-case represen- 
tations of what was measured by the 
benchmark programs. 

As Figure 1 shows, the Pentium II 300 
performs consistently better than the 
Pentium 166. To show the performance 
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Pentium IT 300. 


benefits of the disk subsystem enhance- 
ments, I’ve broken up the Pentium 166 
and Pentium II 300 results into two sep- 
arate graphs. Both graphs include the Pen- 
tium 166 and Pentium II 300 baseline con- 


Brian Butler 


enchmarking is typically thought of 
B as running a set of standard tests on 

a system to compare its performance 
with that of others. Several benchmarks 
have been proposed to measure hard- 
ware and basic software speed: 


e Performance of a complete system. 

e Performance of a specific subsystem 
(disk, video, CPU, memory, and so on). 

e Performance of a particular application 
(spreadsheet, database, CAD/CAM, and 
the like). 

e Performance of server-based applica- 
tions (such as database, file, mail, web). 

e Capacity of a system (often referred 
to as capacity planning). 


No matter the kind of benchmark, all 
good benchmarks use a well-defined 
testing methodology based on real-world 
use of a computer system. In addition, 
they measure performance in a deter- 
ministic and reproducible manner, al- 
lowing the system administrator to prop- 
erly judge the performance and capacity 
of their systems. When used appropri- 


Brian, president of Client/Server Solutions, 
can be reached at brian_butler@ csrad.com. 
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Figure 1: Baseline comparison between Pentium 166 and 





benefits. 


figuration. This inclusion was intended 
for comparative purposes to demonstrate 
the enhancing effects of higher-perfor- 
mance peripherals. Figure 2 shows the 
benefits of enhancing the disk subsystem 


Benchmark Basics 


ately, benchmarks can provide a means 
of determining tuning parameters, reli- 
ability, botthenecks, and system capaci- 
ty— not to mention providing users with 
impartial buying information. 

Benchmarks are made up of three ba- 
sic components: a specification, control 
logic, and implementation. A benchmark 
specification is typically a document that 
describes the details, design goals, data 
points, and execution plan. A specifica- 
tion can become lengthy, depending on 
the complexity of the benchmark (the 
TPC-D specification, for example, is 
about 170 pages long). 

The control logic should have a well- 
defined initialization sequence that is sep- 
arate from the test and data collection. 
The control logic is not really part of the 
benchmark itself, but the benchmark 
does depended on it for repeatability and 
accurate statistics. Repeatability means 
that benchmark results from multiple runs 
of the same test with the same configu- 
ration test environment and parameters 
are consistent with each other. If a bench- 
mark is not sufficiently repeatable then 
the results have no meaning. 

Implementation of a benchmark spec- 
ification can vary, depending on the sys- 
tem under test. The benchmark specifi- 
cation should always have audit trails in 





Figure 2: Pentium 166 disk enhancement 


for our Pentium 166. As this graph 
shows, the disk subsystem enhancement 
influences overall system performance 
by approximately 12 percent for Sys- 
mark, and approximately 20 percent for 


place to verify the implementation is 
done in the spirit of the benchmark. 
To give you an idea of what’s involved 
in writing a complex benchmark, I’m mak- 
ing available electronically (see “Resource 
Center,” page 3) the source code for a 
benchmark that consists of a single 
database table and three transactions. This 
benchmark demonstrates how to create 
and load the table, and how a transaction 
can be coded using ODBC. To build the 
benchmark, I used the Benchmark Fac- 
tory, a tool my company (Client/Server 
Solutions, http://www.csrad.com/) de- 
veloped. Benchmark Factory includes a 
MFC-like C++ framework called “Bench- 
mark Foundation Classes,” for writing cus- 
tom benchmarks. Benchmark Factory in- 
cludes a DLL Wizard used in conjunction 
with Microsoft’s Visual C++ 5.0 which cre- 
ates the template source files, and gives 
you a head start in developing the bench- 
mark definition. The source code for the 
example benchmark is compiled into a 
Windows DLL, then registered with Bench- 
mark Factory. By coding your benchmark 
is this manner, Benchmark Factory takes 
care of all your timing and scaling, and 


provides the tools required to perform re- 


porting and drill-down data analysis. 
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Winstone. Interestingly, Sysmark and 
Winstone didn’t show much difference 
between the 7200- and 10000-RPM SCSI 
drives, while the WinBench results clear- 
ly showed the performance superiority of 
the 10000-RPM SCSI drives. 

Similar results were seen on the Pentium 
II 300 disk enhancements; see Figure 3. 
The Pentium II 300 results show little ef- 
fect of the 7200-RPM SCSI drives on Win- 
stone, but dramatic performance benefits 
for the 10000-RPM drives. Sysmark seemed 
to show a more linear performance bene- 
fit between the 7200- and 10000-RPM SCSI 
drives. The WinBench results again show 
the performance superiority of the 10000- 
RPM SCSI drives. It’s interesting that the 
Pentium II 300 WinBench results clearly 
show the IBM 9XL having a performance 
advantage over the Seagate Cheetah 9LP. 

As remarkable as the disk- performance 
benefits have been on overall system per- 
formance, I still haven’t achieved the goal 
of making a Pentium 166 benchmark bet- 
ter than a Pentium II 300. To achieve this 
goal, you need only look as far as the 
video performance. Video performance 
alone was enough to fool Business Win- 
stone and Sysmark. But video perfor- 
mance alone wasn’t enough to fool the 
Highend Winstone benchmark programs. 
Figure 4 shows the results for the Pentium 
166 and Pentium II 300. Using the Matrox 
Millennium, Business Winstone showed a 
whopping 81 percent performance im- 
provement over the baseline Pentium II 
300 configuration. Sysmark was only 
fooled into showing a 12 percent differ- 
ence. Don’t ignore the WinBench graph- 
ics results. These results clearly show the 
advantages of high-performance video 
controllers — like the Matrox Millennium. 

Lastly, Figure 5 shows the net effect of 
enhancing the video subsystem with the 
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Figure 3: Pentium II 300 disk enhancement benefits. 


Matrox Millennium, 
and the disk subsys- 
tem with the 10000- 
RPM IBM 9XL SCSI 
hard drive. Like the 
video enhancements, 
Sysmark and Busi- 
ness Winstone were 
fooled into thinking 
the slower Pentium 
166 gave better per- 
formance than the 
stellar Pentium II 300 
did. Figure 5 shows 
the Pentium 166 out- 
performing the Pen- 
tium II 300 by 22 
percent on Sysmark, 
and by 98 percent on 
Business Winstone. 
Highend Winstone 
was not fooled by 
any of my tricks. The Highend Winstone 
results always showed proportional per- 
formance increases that you would expect 
to find between a 166-MHz and 300-MHz 
computer. 


Conclusion 

As I expected, it is easy to make a Pentium 
166 appear to outperform a Pentium II 300. 
With a simple video controller and a hard 
drive, I was able to make Business Win- 
stone 97 show a Pentium 166 to have 98 
percent better performance than a 300-MHz 
Pentium II computer. This demonstrates 
that you shouldn’t always trust the bench- 
mark results that you read in computer 
magazines. Keep this in mind when read- 
ing benchmark comparisons: Unless all of 
the computers are tested on equal config- 
urations, allowing only one variable to 
change (like the motherboard or CPU), then 
the results might as well be meaningless. 
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Figure 5: Video and disk enhancement benefits. 


As I discovered, more isn’t always bet- 
ter. The higher-priced video controllers ac- 
tually performed 30 percent lower than the 
lower-priced Matrox Millennium. Con- 
versely, the $200 Matrox Millennium gave 
a bigger performance boost than the 
$1000+ IBM 9XL SCSI hard drive. A small 
battle between SCSI controllers found the 
Diamond Fireport outperformed the 
Adaptec 2940 UW at approximately one- 
half the price. Even though the 24x CD- 
ROM gave good performance, I found it 
too unstable to be trusted. Therefore, in- 
stead of upgrading my next computer to a 
Pentium II, I think Ill upgrade to a 233- 
MHz AMD K6, add the Matrox Millennium, 
and get the cheaper Diamond Fireport with 
4.5-GB 10000-RPM SCSI hard drive. These 
enhancements will give me all the perfor- 
mance I need— and then some. 
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Tracing BSD 








Developing a “trussing” 
relationship with your 
operating system 





Sean Eric Fagan 


ne of the more useful utilities avail- 

able on some UNIX systems is 

“truss,” also known as “strace” or 

“trace.” The truss utility monitors the 
system calls made by a program. Unlike 
a debugger, truss does not need to have 
symbols compiled into the executable. 
Consequently, it can work with any exe- 
cutable, without recompilation, making it 
invaluable in many debugging situations. 
And since truss can monitor processes that 
are already running, it’s helpful for de- 
bugging system programs and daemons 
such as init and inetd. 

Like all 4.4BSD-derived systems, Free- 
BSD has a ktrace program that monitors 
system calls. It does this with a kernel fa- 
cility that dumps information about each 
system call made by a process. ktrace 
stores this information in a file (which 
can grow very large); the kdump utility 
presents the information in a human- 
readable fashion. 

truss uses a different kernel facility that 
stops the process whenever it enters or 
leaves the kernel. After truss gets the de- 
sired information, it restarts the process 
and the cycle continues. Unfortunately, 
this facility isn’t in the stock FreeBSD ker- 
nel, so I added it. In this article, I’ll de- 
scribe how to add this facility to the 


Sean, a BSD developer for many years, 
can be contacted at sef@kithrup.com. 
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FreeBSD 2.2 kernel (the 3.0 kernel al- 
ready has it). 

FreeBSD is a free, UNIX-like operat- 
ing system, based on the 4.4BSD-Lite dis- 
tribution from the University of Califor- 
nia at Berkeley Computer Science 
Research Group (CSRG). 4.4BSD-Lite is 
the successor to Net/2, which formed the 
basis for 386BSD (see the DD] series 
“Porting UNIX to the 386,” by William 





and Lynne Jolitz, January—November 1991 
and February—July 1992). Information 
about FreeBSD, including how to obtain 
it, is available at http://www.freebsd.org/. 
It is also available on CD-ROM from Wal- 
nut Creek CD-ROM (http://www.cdrom 
.com/). 


The stopevent() Kernel Function 

To simplify the kernel changes, I created 
a single function, stopeveni(), that could 
be easily invoked from many places. My 
original version of stopevent() used a vari- 
ation on the standard SIGSTOP signal, one 
of the “job control” signals added to 4.2BSD. 
It’s related to the SIGTSTP signal, which is 
usually sent in response to Control-Z on the 


System Calls 


terminal. Unlike SIGTSTP, SIGSTOP can- 
not be caught or ignored. 

There are problems with this approach. 
When a process receives SIGSTOP, its par- 
ent is notified— this is how the shell 
knows to continue when you suspend a 
program with Control-Z. This doesn’t work 
well for truss. Besides being rude (the par- 
ent may not be the one that requested the 
stop), it also makes it complicated for truss 
to monitor signal delivery. Proper imple- 
mentation also requires some intimate 
knowledge of process switching. 

The alternative is to use the sleep() or 
tsleep() facility. sleep() makes a process 
wait for an event or resource (the oppo- 
site is wakeup()). When a process invokes 
sleep(), that process stops, no signal is 
generated, and no processes are notified. 
This is the approach used in Listing One 
(listings begin on page 105). 

My stopevent() function uses five fields 
(see Table 1), which I added to Free- 
BSD’s process structure (struct proc, in 
<sys/proc.h>). These fields aren’t always 
sufficient. For example, in the case of a sig- 
nal delivery, stopevent() passes back the 
signal number in the p_xstat field in the 
proc structure. This field is also used by 
exit() and wait(). If this multiple usage 
causes future conflicts, another entry may 
have to be added to the proc structure. 

As the code indicates, p_step is set when 
stopevent() is invoked, and checked ev- 
ery time the process wakes up. The 
tsleep() in the loop controls it. 


tsleep() and wakeup/) 

The first argument to tsleep() is the “chan- 
nel,” an arbitrary value used to identify 
the sleeping process for a future call to 
wakeup(). The second parameter indi- 
cates the priority of the process when it 
does awake, and whether a signal can in- 
terrupt the sleep. The third parameter is 
a string used by ps and the kernel’s status 
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Table 1; stopevent() fields. 


(continued from page 38) 

mechanism to communicate to a user 
where the process is. Finally, the fourth 
parameter is the duration of the sleep; a 
duration of zero means the process will 
sleep forever. 

For stopevent(), this means that the pro- 
cess is put to sleep at PWAIT priority and 
cannot be interrupted. The channel it is 
sleeping on is the address of p->p_step, 
one of the fields added to the proc struc- 
ture. Since each process has its own proc 
structure, this ensures that the address of 
p->p_step is unique — meaning that only 
this process will be awakened when 
needed. 

wakeup() is called with the channel 
code that was passed to sleep(). It goes 
through the list of sleeping processes, 
and schedules all processes that have a 
matching value. (The corresponding 
wakeup_one() wakes only the first 
matching process.) stopevent() calls 
wakeup(&p->p_stype) before tsleep(). 
This wakes the truss program (or any 
other program using this kernel facility) 
so it can handle the event. 

There is no attempt to restrict moni- 
toring to a single process— multiple pro- 
cesses can, in fact, be monitoring the 
same target process. (For example, one 
process might be interested only in core 
dumps or system calls; another may want 
to watch the signal deliveries.) As a re- 
sult, all processes that are sleeping on 
the &p->p_stype of a particular process 
will be awakened by this wakeup() call; it 
is up to each monitoring process to check 
only for the events it is interested in. 

However, because &p->p_stype is 
unique for each process, the wakeup() 
only affects those processes that are in- 
terested in this particular process. 


Invoking stopevent() 
The STOPEVENT() macro is a simple 
wrapper that invokes stopevent() if the 
corresponding event is being checked by 
the process. This macro allows a single 
line to be dropped into nearly any part of 
the kernel, with no other code changes. 
The current implementation uses a bit- 
mask that could support 32 different 
events. Six were enough for my needs: 
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S_SIG for a signal delivery; S_EXIT for a 
process exit; S_EXEC for the exec() of a 
new program; S_SCE for a system-call en- 
try; S_SCX for a system-call exit; and 
S_CORE for a coredump. 

STOPEVENTQ() is placed with the ap- 
propriate arguments at a convenient place 
where a process should stop. I added 
STOPEVENT(p, S_SCE, callp->sy_narg); to 
syscall() (the kernel side of the system 
call interface, in /sys/i386/i386/trap.c). This 
has the process stop, just after system call 
entry, if that event is being monitored. 
callp->sy_narg contains the number of 
32-bit word arguments for the system call. 

The file diff.out (available electroni- 
cally, see “Resource Center,” page 3) 
shows the differences between the orig- 
inal FreeBSD 2.2 kernel and the one 
with my changes. In particular, you can 
see where I have placed STOPEVENT() in 
the kernel; in almost all cases, there was 
no extra work to be done. The one ex- 
ception is kern_sig.c; in two places, it is 
necessary to check if signals are being 
monitored at all. The psignal() function 
normally skips the signal delivery if the 
signal is ignored or blocked. However, it 
always handles the signal if the process 
is being debugged, so the debugger can 
follow the attempted signals. I added a 
check for p->p_stop&S_SIG so truss can 
also follow attempted signals. A similar 
change had to be made to issig(), which 
actually receives the signal and delivers 
it to the process. 


Inside Signal Delivery 

Because of the design of the UNIX kernel, 
delivering a signal requires two differ- 
ent kernel functions. UNIX is a pre- 
emptive, multitasking system, thus, user 
processes. that run for too long are in- 
terrupted, then the kernel runs. Howev- 
er, the kernel itself is never preempted. 
When it is ready to change to a new pro- 
cess context, it does so explicitly by call- 
ing mi_switch(), which is invoked from 
issig(), tsleep(), and just before return- 
ing from a system call. 

The kernel is almost always running in 
the context of a process. Since one pro- 
cess cannot arbitrarily affect another, the 
kernel cannot do so on behalf of a pro- 


cess, either. This is why signal delivery, 
for example, requires two functions. The 
psignal() function runs in the sender’s 
context; issig() runs in the receiver’s con- 
text. A similar issue arises with truss, 
which also interfaces two different pro- 
cesses: one being traced and one doing 
the tracing. 


Interfacing with procfs 

The procfs filesystem (usually available as 
/proc) was added to 4.4BSD to simplify 
access to the kernel process list. Within 
procfs, each process is represented by a 
directory, and files within a directory pro- 
vide access to particular information about 
that process; see Figure 1. For example, 
“Is /proc” will list all running processes, 
while “ls -1 /proc” will also tell you the 
owner of each process. 

There are some tricks involved in work- 
ing with procfs. The procfs_exit() kernel 
function forcibly closes any open procfs 
files that refer to the specified function. 
This must be called when a process exits, 
since the corresponding procfs files will no 
longer exist. When I added a call to 
STOPEVENT() to kern_exit.c, I placed it 
before the call to procfs_exit() so that a 
monitoring process can have one last look. 

The procfs file system is also used by 
my procctl and truss programs to access 
and control the new tracing facility. 

There are several possible ways to in- 
terface with procfs; the easiest is to use 
read() and write(). For example, the procfs 
ctl and status files use this type of inter- 
face: The command “echo hup > /proc/ 
curproc/ctl” sends a SIGHUP signal to the 
shell. Similarly, “cat /proc/curproc/status” 
yields something like cat 1852 1849 114 
18555 5,1 ctty 880144638, 200853 0,0 
0,4255 nochan 100 100 20,20, 20,0,8, 10, 
11,31, 1000, 2000. 

However, when trying to control a pro- 
cess (which involves sending a command, 
then finding out the status), this proves to 
be pretty ungainly. To stop a process, you 
would first send a stop command to the 
ctl file, then repeatedly read the status file 
until the process stopped. (Plan 9 uses a 
variant of this.) 

I chose to communicate through ioc#l(). 
Normally, ioctl) cannot be called with a 
regular file, but that can be solved by re- 
moving the preemptive error return from 
un_ioctl( )— the per-vnode IOCTL routine 
will still return the correct error (ENOTTY) 
for any file type that doesn’t support it, 
but the correct IOCTL routine will be 
called for those that do. 

The rest of the procfs changes are to 
add ioctl options. First, they must be de- 
fined. To do that, I added a new kernel 
file <sys/pioctl.h> (for “process ioctl”), 
which defines six new ioctl options: 
PIOCBIS sets event flag(s); PIOCBIC 
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Figure 1: The /proc file system. 


(continued from page 40) 

clears event flag(s); PIOCSFL sets non- 
event flags; PLOCSTATUS gets process 
status information; PIOCWAIT waits for a 
process to stop; and PIOCCONT contin- 
ues a process. 

The _IOW and _IOR macros in Listing 
Two construct a 32-bit value that is used 
by the IOCTL system call to determine 
what to do with the third argument to 
IOCTL—_IOW means that it is copied 
from user space to kernel space, and _IOR 
is the other way around. 

Listing Three shows procfs_ioctl, which 
implements these new ioct/ options. As 
you can see, PIOCBIS, PIOCBIC, and 
PIOCSFL are fairly straightforward. 

Both PIOCSTATUS and PIOCWAIT fill 
out a procfs_status structure, provided 
by users. The procfs_status structure in- 
dicates whether the process is stopped 
on an event or not (and, if so, which 
event it has stopped on), which events 
are being monitored, any extra data set 
by STOPEVENT(), and the currently un- 
used flags. 

The difference between the two is that 
PIOCWAIT will put the requesting pro- 
cess to sleep until the target process stops. 
This is, of course, where the wakeup() in 
stopevent(), mentioned previously, comes 
into play. First, PLOCWAIT checks to see 
if the process is already stopped. If so, it 
has nothing to do. If the process is not 
stopped, however, PIOCWAIT then calls 
tsleep() on the p_stype. When stopevent() 
goes to stop the process, it first schedules 
any sleeping processes to be woken up. 
Unlike the tsleep() in stopevent(), PCATCH 
is set, meaning that the waiting process 
can be interrupted by signals. 

The last currently supported IOCTL op- 
tion is PLOCCONT. This restarts a process 
that was stopped, and is modeled after the 
ptrace PT_CONTINUE method. This allows 
a stopped process to continue with a spec- 
ified signal, if desired. (PT_CONTINUE also 
allows an address to be specified; howev- 
er, the PC can be set via procfs, by open- 
ing up /proc/process-id/regs and modi- 


fying it.) 
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PIOCCONT restarts the process by first 
clearing the p_step flag in the target pro- 
cess; this tells stopevent() (which is run- 
ning in the context of the stopped pro- 
cess) that it is okay to run now. It then 
wakes up the process, and is done. 


The procctl Program 

The first program to use the interface is 
procctl. This is an administrative program, 
which “unsticks” processes. Remember 
that a process stopped via stopevent() will 
not respond to signals. 

As programs go, procctl is simple but 
limited— the user must specify process 
IDs and there are no options. For each 
process, it simply clears the event bit mask, 
and continues the process. It warns about 
failures, but continues. 

Although simple, it is functional and 
does demonstrate the interface: PIOCBIC 
clears specified bits (~O indicates that it 
should clear all of the bits in the event 
mask), and PIOCCONT wakes the pro- 
cess (and returns EINVAL if the process 
was not already stopped). 

The program was also absolutely nec- 
essary during development— a process 
that was stopped in stopevent() could not 
be killed, which made it very difficult to 
regain control of the system. 

Two obvious improvements to procctl 
present themselves: “unsticking” all pro- 
cesses owned by a specified user, and hav- 
ing the program, itself, search the /proc 
filesystem, rather than having the user 
specify the process IDs. 


The truss Program 
The next application, truss, is of more gen- 
eral use. At its simplest, truss is invoked as 
truss program args. This will trace the sys- 
tem calls made by a program. Example 1 
shows typical truss output. However, it may 
be desirable to have the output of truss 
stored in a file somewhere; the -o option 
will do that. Also, you may wish to ignore 
certain events; the -S option will ignore de- 
livered signals. (This could be extended to 
other options in a straightforward manner.) 

Lastly, truss can be used to trace an al- 
ready-running process, with the -p op- 
tion. This can be used, for example, to 
help debug a daemon process that can- 
not be easily or conveniently restarted. 

After going through the options, if a pro- 
cess ID was not specified, truss needs to 
set up the process itself. It does this in set- 
up_and_wait(), whose sole purpose is to 
create a process, execute the desired pro- 
gram, and leave that process in a stopped 
state. Note that it sets the event mask to 
S_EXEC | S_EXIT initially, and applies that 
to itself after the vfork() and before the 
execup( ). 

In the event that the specified program 
is unable to be executed, then the pro- 
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Example 1: Sample output of truss. 


cess will exit, and stop; if, on the other 
hand, it succeeds, then the process will 
stop before returning from executing in- 
side the kernel. (In both cases, by “stop,” 
I mean that the process will wait in 
stopevent() to awaken.) 

In start_tracing(), truss opens up the 
mem file for the given process (for ex- 
ample, /proc/45/mem), and sets up the 
event mask for the target process. (The 
event mask may or may not set S_SIG, de- 
pending on the options given to truss.) 

Next, truss determines the emulation 
type of the target process. FreeBSD sup- 
ports emulation of different operating sys- 
tems (such as Linux and SCO). In many 
cases, the system call numbers are differ- 
ent for the different operating systems, so 
truss needs to know which mapping to 
use. It determines this by reading the 
procfs etype file; by default, this is 
“FreeBSD a.out”, although “Linux ELF” 
and “IBCS2 COFF” are also common pos- 
sibilities. For the time being, truss supports 
native FreeBSD programs and Linux ELF 
binaries, although I will concentrate only 
on the native (i386 _syscall) versions in 
this discussion. 

Next, truss enters the main loop: It waits 
for the process to stop (and finds out why 
it stopped); prints the desired information; 
and restarts the process. It stops when the 
process has died (pfs.why == S_EXIT). 

When a system call is entered or exit- 
ed, truss calls two processor-specific and 
OS-specific functions. For FreeBSD/a.out 
executables (that is, native binaries), the 
two functions are i386_syscall_entry() 
and 1386_syscall_exit(). The Linux equiv- 
alents get the system call number and ar- 
guments through different means. 

Under FreeBSD on the x86 architecture, 
the system call number is contained in the 








EAX register; success or failure is indicat- 
ed by setting the carry bit in the process 
status register, and the return value is in 
EAX (and, sometimes, EDX as well— 
some system calls, such as fork(), return 
two values). 

The first thing 1386_syscall_entry() and 
i386_syscall_exit() do is to determine 
whether they need to reopen the register 
file. After the first call, they should not 
need to. Then, they seek to the beginning 
of the register file and read the entire reg- 
ister set. 

The system-call names are generated 
automatically in syscalls.c. This file con- 
tains an array of all of the system-call 
names available on the system and is gen- 
erated automatically from the system-call 
configuration file, which is part of the ker- 
nel. It fills the need to translate system- 
call numbers to system-call names, but 
fails to address the need to know the types 
of the arguments. However, the number 
of arguments is listed in the nargs pa- 
rameter to 1386_syscall_entry(). You can, 
therefore, get the arguments by looking 
in the process’ memory space — the ar- 
guments are at ESP + sizeofCint). 

1386_syscall_exit(’s job is easier— it 
determines if the system call failed or suc- 
ceeded (by checking the carry bit of the 
PSW), and prints out the value of EAX ac- 
cordingly (either as an error or as a return 
value). This does not check for system 
calls that return multiple values. 

If you are interested in further experi- 
menting with truss code, additional listings 
are available at http://www.freebsd.org/ 
~sef/truss/index. html. 


DDJ 
(Listings begin on page 105.) 
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A Real-lime 
Performance 





Visualizer for Java 





Finding out 
where to squeeze for 
performance 





John J. Barton and John Whaley 


veryone who uses Java is interested in 

its performance. Why? Because Java 

programs running on Sun’s original in- 

terpreter-driven Java virtual machine 
(JVM) were several orders of magnitude 
slower than comparable programs written 
in C/C++. Bottlenecks included synchro- 
nization, null reference, array-bounds check- 
ing, and Java’s exception handling. Just-in- 
time compilers JITs) on more advanced 
JVMs have begun to close the gap, but fur- 
ther performance gains are possible. 

To investigate the issues surrounding 
JVM performance, Derek Lieber (a mem- 
ber of our group at IBM’s TJ. Watson Lab- 
oratory) wrote an experimental JVM. To 
improve its run-time performance, he 
needed a way to discover its performance 
bottlenecks. A traditional profiler would 
be of limited use here, because it would 
simply profile the JVM without regard to 
the Java program running on it. Also, tra- 


John Barton is a researcher at IBM’s TJ. 
Watson Lab. He can be contacted at 
fjb@watson.ibm.com. John Whaley is a stu- 
dent at MIT. He can be reached at jwhaley@ 
mit.edu. 
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ditional profilers cannot make sense of 
JIT-compiled code, or make a clear dis- 
tinction between time spent executing 
Java code and time spent in the JVM. 














































































































































































































Therefore, we set out to write a perfor- 
mance visualizer for our JVM. We'll de- 
scribe that visualizer in this article. The 
techniques we'll discuss are general 
enough to apply to other virtual machines, 
and the graphics client applet for our vi- 
sualizer is available electronically; see “Re- 
source Center,” page 3. 

Our basic design goals involved ease 
of use, scalability, and potential reuse of 
the profile data to dynamically target op- 
timizations. For routine use, the visual- 
izer should have minimal impact on tar- 
get program performance. It should be 
simple so that it can be applied to arbi- 
trary Java programs, and it should ob- 
tain the performance results dynamical- 





ly, as the target program runs. We want- 
ed the visualizer to work on long-running 
programs where performance is a major 
issue, so that ruled out schemes involv- 
ing large trace files. Moreover, we only 
need information on the performance bot- 
tlenecks: Any seldom-run code need nev- 
er be considered by the visualizer. Final- 
ly, we wanted to use the performance 
information to drive a dynamic JIT opti- 
mizer for our JVM, reinforcing the need 
for simplicity, efficiency, and scalability. 

With these goals in mind, we built a 
program-counter sampling engine into 
the JVM. The engine is driven by a high- 
frequency Windows multimedia (WINMM) 
interrupt. Each time the WINMM interrupt 
fires, the handler stores the VM’s current 
program counter into a ring buffer. 

We chose a program-counter sampling 
approach rather than a timed function en- 
try/exit approach because sampling the 
program counter is much less intrusive. 
Adding stubs at entry/exit points may seem 
innocuous, but can substantially change 
the performance of the profiled program 
due to changes in processor cache hits, 
branch prediction tables, and the like. Also, 
no changes are necessary to the JVM or to 
the JIT-compiled code, simplifying things 
greatly. Another benefit is that the sampling 
overhead is roughly constant, and can eas- 
ily be adjusted by changing the frequency 
of the interrupt. Finally, this approach gives 
us a finer granularity so that we can pro- 
file at the level of a single bytecode, rather 
than at the level of a method. 

The sampling engine is controlled by 
a display client via a socket connection. 
The display client can tell the sampling 
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a: 6 
Introducing Objective Studio 


ith a family of more than a dozen products specifically designed 
\ N | for professional developers, Stingray Software is fast becoming the 


one-stop shop for the object oriented programmer — and with 
he new Objective Studio™ you can save thousands! 


3 Studios to Choose From 


JIbjective Studio Standard™ 
DbjectiveToolkit *« Objective Grid * Objective Chart 


Z 


Z| 








Objective Studio Professional 


Toolkit 

Grid 

Chart 
Toolkit Pre 
Visual CASE 


Dbjective Studio Professional™ oe 
DbjectiveToolkit * Objective Grid * Objective Chart * Objective Diagram 
Dbjective Edit * Objective Toolkit Pro * Visual CASE 


SOO TNAY 


Objective Studio Universal™ 

DbjectiveToolkit * Objective Grid * Objective Chart * Objective Diagram 
Dbjective Edit * Objective Toolkit Pro * Visual CASE 

Dbjective Toolkit/X * Objective Grid/CCE ¢ Objective Plug-in 
Dbjective Blend * Objective Grid/Java »* Objective Grid Enterprise 


Objective 
Toolkit 


This library of over 70 classes helps fill 
he holes in MFC. It provides docking 
windows, customizable ‘97 style toolbars, 
<xplorer style toolbars, and Outlook 
short-cut bars to switch between views. 
Toolkit includes a number of extended 
controls such as: time/date fields with 
Jrop-down calendar, currency fields 
with pop-up calculator, multiple column 
nulti-selection tree control, color wells, 
and dozens more! Toolkit Pro goes 
oeyond Toolkit to solve some of the most 
difficult MFC problems. 


= Objective 
Grid 


This full featured grid control is avalible 
in MFC, Java, and ActiveX versions. It 
is built on object-oriented technology for 
complete extensibility. It supports a 
wide variety of field types including: 
bitmaps, combo boxes, spin buttons, 
check boxes, radio buttons, push buttons, 
check boxes date/time fields, masked 
edits, and rich text edit fields. It also 
includes sophisticated features such as 
zoom, do/undo, print preview and Excel 
formula support. 











; Objective 
Chart 


Add charts to any CWnd based class 
including views, dialogs, or even controls. 
Over 20 charting types can be easily 
extended. Complete end-user customization 
of charts is provided. 







Objective 


Allows you to create “Visio™” style 
programs in just minutes. It creates a 
full OLE server application that allows 
your users to create objects from a 
palette using drag and drop. 


Visual 
Case 





The first CASE tool designed specifically 
for the Visual C++/MFC developer. It 
represents your source code in UML 
diagrams that allow you to get a bird’s 
eye view of your program. Visual CASE 
provides complete round trip engineering. 
It can generate Visual C++ source code 
or read Visual C++ source code into 
Visual CASE without data loss. 
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Objective 
Edit 





Lets you add a full featured multi-colored 
syntax editor in just minutes. You can 
color code keywords, comments, strings 
and text. 


Objective 
‘Blend 


The perfect toolkit for any Java developer. 
It adds modern MFC style controls to 
Java programs easily. It supports tabs, 
multiple column trees, calendar, calculator, 
spin buttons, progress meters, and slider 
controls. Fully supports Asymetrix’s 
Supercede, Visual J++ and JavaBeans. 





All products, except Visual CASE, 
include FULL SOURCE CODE and all 
products carry a 30-day unconditional 
money back guarantee. To find out 
more, visit our web site for demos, 
white papers, and evaluation copies. 
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Stingray Software Inc. 
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Figure 1: Block diagram of our Java performance ae: 


(continued from page 44) 

engine to start or stop profiling, change 
the sample frequency, transmit the con- 
tents of the buffer containing the trace 
data, and so on. Because we use a Stan- 
dard TCP socket, the display client could 
be written in any language; we chose to 
implement it in Java and execute it in a 
separate JVM. Furthermore, the display 
client can be run on a remote machine 
so that it will interfere minimally with the 
program we are profiling. 

As Figure 1 shows, our performance vi- 
sualizer has four parts— two in the dis- 
play client and two in the sampling en- 
gine. On the display side, there’s the GUI 
and its updater, which polls for perfor- 
mance data from the sampling engine 
across the socket. On the sampling en- 
gine side, we have the socket service code 
and interrupt handler. 


The Graphical User Interface 

The GUI is a Java applet written using 
lightweight components and the Java 1.1 
event model (http://java.sun.com/products/ 
jdk/1.1/docs/guide/awt/designspec/ 


VM Internal 


Native 





Figure 3 The OverallPieChart. 
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events.html). Figures 2—6 show the five 
possible frames users can bring up to 
look at the performance results. The 
“ConnectionFrame” in Figure 2 lets users 
control the connection to the JVM being 
profiled; we won’t discuss it further. 

The “OverallPieChart” frame in Figure 
3 contains a pie chart showing what frac- 
tion of the overall time is spent in Java 
code (user or library code), in supporting 
run-time routines of the Java virtual ma- 
chines (garbage collection, for instance), 
or in the underlying operating system’s 
kernel. Below the pie chart are buttons to 
activate the other frames. 

The other frames contain pie charts 
and lists. Figure 4 presents what’s prob- 
ably the most interesting frame for most 
Java programmers, the “Java Methods” 
frame, which is activated by clicking on 
the “Java” wedge in the OverallPieChart, 
or with the Show Java button. This frame 
shows how the time spent in Java methods 
is distributed among those methods. The 
pie chart on the left shows a wedge for 
each method, and the list on the right gives 
the method names. Both are sorted by to- 
tal time spent. All small wedges (<1 degree) 
are combined into the black “Other” wedge 





Machine narne: 


| Port number (TCP): 


Trying to connectto 
|puptent watson.ibm.com:6827..successful. 





Fiver 2: - The ConnectionFrame. 


in the lower right. The rest of the colors in 
the pie chart are arbitrary. From this pie- 
chart view, users can judge which Java 
methods are taking the most time. 

As the mouse cursor enters a wedge 
corresponding to a method, the wedge 
lights up and its information is displayed 
in the status bar. In addition, the corre- 
sponding entry on the list to the right is 
highlighted. If users left-click on a wedge 
or list entry, a menu pops up with options 
for further information on the method. Se- 
lecting Display Window from this pop-up 
menu, for example, brings up a Method 
frame; see Figure 5. 

The Method frame is analogous to the 
Java Methods frame, but corresponds to 
the breakdown by line number of the to- 
tal time spent in a particular Java method. 
The line numbers are obtained from the 
debug line table information in the 
method’s class file, so the class file must 
be compiled unoptimized for separation 
by line number to work correctly. 

Returning to the OverallPieChart frame, 
Figure 2, users can click on the VM wedge 
or select Show VM to produce the Vir- 
tualMachinePieChart; see Figure 6. This 
chart has the same form as the Java and 
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Method frames, but the contents list the 
C++ functions in our JVM. The names look 
strange because we simply print the man- 
gled function name created by the C++ 
compiler. This name includes an encoding 
of the function arguments, so users can 
discern the different overloaded functions. 


Sampling with WINMM 

At the opposite end of our visualizer sys- 
tem lies the program-counter sampling 
engine, which accumulates samples at 
frequent intervals during program exe- 
cution. The C++ code in Listing One 
(listings begin on page 105) accumulates 
a single sample. This code is a static call- 
back function that is called by the Micro- 
soft Win32 WINMM interrupt handler ev- 
ery time the WINMM interrupt fires. It 
creates a Win32 CONTEXT object to 
hold a thread context and then fills it by 
calling Win32’s GetThreadContext( ) 
function on the handle of the thread to 
be sampled. In this case, our JVM uses 
a single Win32 thread, called the “work- 
er,” to implement all the Java threads 
(TaskManager._workerThreadHandle.) 
Finally, the program counter register 
(Eip) from the thread context is stored 
in the ring buffer by the JavaProf.add() 
member function call. 

To ensure a low overhead, we want 
to minimize the amount of work done 
inside the callback function. The Java- 
Prof.add() function simply writes the 
given program counter into a preallo- 
cated ring buffer; it does no allocation 
or other work that might fail in long- 
running profiles, and it keeps the over- 
head down to a bare minimum. On the 
negative side, the GetThreadContext call 
is not trivial, because it is a Win32 call 
and has to go through the kernel, but 
that is a necessary evil. Fortunately, the 
GetThreadContext call is fairly quick. 

The method JavaProfiler::start() (see 
Listing Two) installs the callback function 
as a WINMM interrupt handler. The two 
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Visit our Web site at www.vedit.com and 
download a fully functional demo. 


Originally created in 1980 as the first 
programmer's editor, the new VEDIT PLUS 
5.1 has evolved into a unique multi-function 
tool that effortlessly handles all of your 
routine, tricky and huge file editing needs. 


Only VEDIT is fast enough for huge 100+ 
Megabyte files. It can perform thousands of 
search/replace operations in little more time 
than it takes to just copy the file. 





Written mostly in assembly language, it's 


Instant startup, instant everything. 


Advanced features include multi-file, multi- 
window editing, template editing, syntax 
highlighting, search/replace with regular 
expressions and columnar blocks. Fully 
configurable keyboard and emulations. 


VEDIT PLUS - Windows/DOS: $149. 
(Includes 32-bit Win 95/NT, 16-bit Win 3.1 


30-day money-back guarantee. Attractive 
site license pricing. Also VEDIT: only $89. 


Toll Free: 1-800-45-VEDIT (1-800-458-3348) 
Telephone: (734) 996-1300, Fax: (734) 996-1308 
E-Mail: sales@vedit.com, Web: www.vedit.com 
Mail: P.O. Box 1586, Ann Arbor, MI 48106 
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Figure 6: The VirtualMachinePieCbart. 


~ WOW! VEDIT can do that! 


ASCII, EBCDIC, Hex, 1000+ Megs, Database, CD-ROM, Mainframe, 
Postscript, C, HTML, editing, translating, converting, and letters 
to mom, the newVEDIT PLUS 5.1 forWindows/DOS does it all. 





"| love VEDIT. | replaced a 
difficult-to-maintain 3000 line 
text conversion program 
written in C with a 300 line 
VEDIT macro. Best of all, the 
VEDIT macro is faster." 

S. Krishna, Quadrant, TX 


Lightning fast multi-purpose tool 

@ Edit absolutely any file - text, data, or 
binary up to 2 Gigabytes. 

@ Edit in ASCII, Hexadecimal, Octal, 
EBCDIC or any combination of modes. 


@ Edit DOS, Unix and Mac text files, data 
files with fixed / variable length records. 


@ Translate between EBCDIC and ASCIl, 
IBM PC and ANSI. Flexible sorting. 


@ Perform numerous search/replace on 
entire groups (even thousands) of files. 


@ Powerful C-like macro language for 
writing custom translators, filters, more. 


@ Exceptionally compact. Easy to install. 


New VEDIT PLUS 5.1 features 

@ 32-bit version optimized for Win 95/NT. 
Attractive ANSI and OEM editing fonts. 
Convert newlines, smart detab/retab. 
Enhanced HTML and C editing. 


Improved word processing, searching, 
printing, “ctags”, and much more. 
VEDIT is a registered trademark of Greenview Data Inc. 





Greenview Data 
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WINMM functions called here are docu- 
mented in Microsoft’s Visual C++ docu- 
mentation. The timeSetEvent call installs the 
interrupt handler with a frequency equal 
to the desired sampling frequency or to the 
fastest sampling frequency supported by 
the WINMM timer, whichever is faster. (On 
Windows 95/NT, the fastest sampling rate 
supported is once per millisecond, which 
is adequate for our purposes.) 

After this call, the Win32 kernel begins 
calling the callback function at the spec- 
ified frequency and the buffer begins to 
fill with program counters. 


The Program-Counter Server 

The program-counter sampling engine is 
controlled over a socket connection by a 
client/server type connection. The server 
end sits inside our JVM. It is also written 
in C++; we have to access the C data struc- 
ture stored by the interrupt handler and 
call C functions for mapping the program 
counter. Therefore, coding all of it in C++ 
seemed simplest. In the usual fashion of 
socket programming, the server starts by 
listening for incoming connections on a 
specified TCP port; we arbitrarily picked 
the default to be 6789. 

When a client attempts to connect, the 
server binds the port and begins waiting 
for a command from the client. The pro- 
tocol between the client and server is sim- 
ple: First, the client sends the command 
(a single byte) followed by its arguments. 
Depending on the command, the server 
then responds to the client with the re- 
quested information. The client closes the 
socket when it is done with the service, 
and the server goes back to listening for 
incoming connections. 

The server supports control commands, 
such as start/stop profiling and change 
sampling frequency, along with data ac- 
quisition commands, such as “dump 
buffer” and “look up point.” Because the 
connection between the client and the 
server may be too slow to send thousands 
of points per second, the server supports 
a number of methods of reducing the 
amount of data sent. The server supports 
commands to not send certain fields, such 
as the method line-number field. It sup- 
ports a command to change the fre- 
quency of the WINMM interrupt, reduc- 
ing the number of points sent per second. 
It also can be told to ignore certain kinds 
of data points, such as all points in the 
kernel or all points not in Java methods. 

The most often used command is “get 
all data.” When the server receives this 
command, it decodes the program coun- 
ters in its ring buffer into type, function 
name, and line number Cif possible) and 
sends the decoded version over the sock- 
et connection. The decode process com- 
pares the program counter value to known 
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address ranges for the Win32 kernel, C++ 
code for our Java VM functions, and com- 
piled bytecodes from our JVM (our VM 
acts like a continuous Just-In-Time com- 
piler, converting all bytecodes to Intel in- 
structions). 

For the Win32 kernel functions, we 
don’t try to analyze the address any fur- 
ther; visualizing the performance of the 
Win32 kernel is not the purpose of this 
tool. For the internal VM functions we con- 
vert the address to a C++ function name 
using the .map file information generated 
by Visual C++ when we compiled the 
JVM. Finally, for addresses that map into 
regions of compiled bytecodes, we call 
functions in our JVM to produce the Java 


Several of the 
functions near the 
top of our profile 
come from garbage 
collection 





method name and line-number values. 
These functions did not have to be writ- 
ten explicitly for the profiler; they were 
already present for Java’s exception call- 
back trace generation. 


Updating the User Interface 

The last section of our visualizer lies be- 
hind the user interface on the client JVM 
side. This is the UI “Updater” that con- 
nects to the profile server and polls it, 
dispatching out the returned information 
to the appropriate windows opened by 
the user. 

Java’s 1.1 event model used in this part 
of the visualizer allows the number and 
kind of UI panels to be selected dynam- 
ically and for new information to be sent 
to all or just some of them. The event 
model relies on callback functions in Java’s 
EventListener interface. 

The coordinator of the event flow is the 
class ProfileUpdater, which has methods with 
names of the form addXXXListener() (where 
XXX is a kind of event). These methods 
build lists of EventListener objects passed 
to it as arguments. When the ProfileUpdater 
has new data, it creates a corresponding 
event object and passes it to each EventLis- 
tener registered with it. Each different kind 
of data will result in a different kind of 
event object passed to a different list of 
EventListeners. In this way we can add an 


arbitrary number of panels to the UI and 
update them efficiently. 

The ProfileUpdater has its own thread 
of control— that is, it implements Java’s 
Runnable interface. It acts as a client of 
the profiler engine, requesting data from 
the engine, converting the data into ap- 
propriate kinds of events, and then mul- 
ticasting the events to the corresponding 
list of EventListeners. Then it sleeps for a 
fixed time interval, wakes up, and repeats 
the polling loop. 


Where our VM Spends its Time 

With all of the components of our visual- 
izer built, we can run it by simply giving 
a command-line flag to our JVM to start 
the server during initialization. In a sepa- 
rate JVM, we start the client and connect 
to that server. Then, the polling/updating 
cycle starts profiling any Java program run- 
ning in the first JVM. 

As soon as we completed the first ver- 
sion of our profiler we discovered a one- 
line change to our Java VM that gave a 10 
percent speed up. At the top of the Vir- 
tualMachinePieChart we found memset(), 
the libc function usually used to clear mem- 
ory. Derek, our VM guy, mumbled some- 
thing unprintable when he saw the pro- 
file: He quickly recognized that clearing 
memory for every Java object created 
meant many calls to memset(). Moving the 
memset() call to clear open memory after 
garbage collection eliminated this cost. 

Of course, most of the revelations of a 
profiler are not so easy to act upon. Sev- 
eral of the functions near the top of our 
profile come from garbage collection. Our 
JVM uses a very simple copying collector; 
there are many better techniques but they 
all require more coding effort to implement. 
Other functions at the top indicate less se- 
vere changes. A couple of things we need 
to consider are avoiding string comparison 
for type checking in the VM and avoiding 
the creation of Java’s exception traceback 
string when the exception is caught. All of 
these represent obvious performance bot- 
tlenecks that were known when the code 
was written. Now we know which ones re- 
ally count and we can attack them. In ef- 
fect, our profiler creates a continuous “to 
do” list for optimizing our Java VM. 

Based on the VirtualMachinePieChart, 
we think we can increase the speed of 
our JVM code dramatically. At this point, 
the Java code would dominate, meaning 
that we should then put more effort into 
our bytecode-to-Intel instruction transla- 
tor. We know that much improvement can 
be realized here since we currently do 
none of the optimizations common in JITs. 


DDJ 
(Listings begin on page 105.) 
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Java gets back to its 
embedded-system roots 





Kelvin Nilsen 


mbedded computers are everywhere. 

In grocery stores, for example, they 

tally up your bill, gather consumer- 

profile information about your buy- 
ing habits, provide online access to your 
bank and credit-card accounts, and open 
the doors for you when you are ready to 
leave. In cars, embedded computers un- 
lock doors, roll down windows, adjust 
mirrors, control fuel injection, report 
speed, and monitor for situations that re- 
quire antilock braking or air-bag deploy- 
ment. At home, they control the circula- 
tion and temperature of air, microwave 
and convection ovens, dish and clothes 
washers, television sets, stereo systems, 
cable or satellite TV converters, and remote- 
control devices. Projections by companies 
such as Motorola, in fact, suggest that by 
the year 2000, the average number of mi- 
crocontrollers in a typical home will ex- 
ceed 200. Furthermore, the report “Soft- 
ware 2000: A View of the Future” UCZ, 
1994) projects that by the year 2010, there 
will be ten times as many software devel- 
opers working on embedded-system soft- 
ware as will be developing commercial 
software for use with desktop computers 
and servers. 


Kelvin is president of NewMonics and can 


be contacted at hitp://www.newmonics 
.con/ or kdn@newmonics.com. 
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Java is attractive to embedded-systems 
developers for many of the same reasons 
it appeals to desktop/Internet program- 
mers. Unlike C++, Java has a clean and 
simple design. Java is easy to learn and 
easy to use. It is object oriented and 
strongly typed. Java compilers prohibit 
many error-prone practices that are per- 
mitted in C and C++. Beyond the standard 
math, file handling, and text-formatting li- 
braries, Java also provides standard li- 
braries for Internet, multimedia, and multi- 
threaded programming. Finally, Java 


ee 
ase 2 
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provides garbage collection, a high-level 
service that automatically reclaims mem- 
ory for objects that are no longer in use. 
Even though Java development tools and 
compilers are still relatively immature, 
embedded-systems companies are already 
experimenting with this new technology. 
Engineers at a large telecommunications 
company recently reported they were 
twice as productive writing code in Java 
than in C++. Engineers from the same 





company reported that they had been dis- 
appointed with C++, in part because they 
had found it necessary to spend millions 
of dollars finding and fixing memory leaks 
in their C++-based telecommunications 
software. Another issue raised by these 
engineers was Java’s impact on human re- 
sources. They had found it difficult to re- 
cruit competent C++ programmers and 
subcontractors. Even more difficult was 
finding people skilled in this company’s 
in-house proprietary languages and tools. 
These engineers felt that it would be eas- 
ier to find skilled Java programmers, or, if 
necessary, to train existing technical staff 
in effective use of Java. 

The Portable Executive for Reliable Con- 
trol (PERC) is a clean-room implementa- 
tion of the Java Virtual Machine developed 
by NewMonics (my company). PERC is 
designed specifically for embedded and 
real-time programming applications. Some 
of the qualities of PERC that make it es- 
pecially well suited for these applications 
are its incremental defragmenting real-time 
garbage collector; a special ROMizer tool 
that allows Java bytecodes to be execut- 
ed out of ROM memory and supports 
ahead-of-time compilation of Java byte- 
codes to native-machine language; a Mini- 
mal Windows Toolkit that is approximately 
one third the size of Java’s Abstract Win- 
dows Toolkit; and optional real-time ex- 
tensions that allow real-time tasks to re- 
serve resources for periodic execution, a 
particular amount of live memory, and a 
particular rate for dynamic memory 
turnover. In supporting the real-time ex- 
tensions, PERC converts the various 
memory requests into a CPU-time re- 
quirement for garbage collection. Then 
it uses rate-monotonic analysis to deter- 
mine whether the workload representing 
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(continued from page 50) 
all application-level tasks combined with 
the garbage-collection task can meet the 
desired real-time constraints. Once CPU 
time and memory budgets have been 
granted, PERC enforces the budgets. Tasks 
are not allowed to run longer or allocate 
more memory than was budgeted. 

Shrinking the memory footprint is of 
prime importance in tailoring Java imple- 
mentations for embedded systems. For 
consumer electronics products shipped in 
the millions, every penny shaved from unit 
manufacturing costs adds up to a signifi- 
cant savings. Handheld devices must be 
small and lightweight. Battery- powered 
devices must operate on very low current. 
picoPERC is a Java subset in which the 
core Virtual Machine implementation fits 
in less than 64 KB of memory. This 64-KB 
footprint is nearly 1/16th the size of Java- 
Soft’s yet-to-be-defined EmbeddedJava 
and over 50 times smaller than typical En- 
terprise Java implementations. 

picoPERC consists of a core virtual ma- 
chine upon which optional extensions can 
be layered. Unlike standard Java, the pico- 
PERC core lacks support for floating-point 
numbers, 64-bit integers, automatic 
garbage collection, the standard Jang, util, 
io, and net class libraries, and dynamic 
bytecode loading and verification. All of 
these capabilities can be added to the pi- 
cOPERC core under programmer control. 

With so much of standard Java missing 
from the picoPERC core, you might won- 
der which, if any, of the traditional Java 
benefits are supported by picoPERC. The 
main benefit is that the source language is 
the same as standard Java. This means that 
if you are developing very small, deeply 
embedded systems, you can use the same 
language and development tools you use 
for larger embedded systems. Also, this 
means that picoPERC programmers bene- 
fit from the simplicity and consistency of 
the Java language design, and from the 
strong type checking performed by Java 
compilers. Another benefit of picoPERC is 
that it uses Java’s portable bytecode rep- 
resentation. This means that portable soft- 
ware components can be developed or de- 
livered by multiple independent suppliers 





Figure 1: The visual Black Jack 
program. 
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and integrated in binary form, without re- 
quiring source code access for system in- 
tegration. Finally, because picoPERC uses 
the same byte code representation as tra- 
ditional Java, it is possible to run, evaluate, 
and debug picoPERC programs using tra- 
ditional Java Virtual Machines and integrated 
development environments. Once the soft- 
ware development effort is done, the 
ROMizer tool can be used to target the byte- 
code representation to ROM memory for 
execution by the picoPERC Virtual Machine. 


Black Jack Application 

Several months ago, a customer asked us 
to assist with the development of an ap- 
plication that could demonstrate Java’s 
benefits to management in order to ob- 
tain funding approval for development of 
a larger application. The sample applica- 
tion would have to run on existing hard- 
ware, which limited the RAM memory to 
one MB, restrict output to a two-line by 
32-character display, and take all input 
from a numeric keypad. We wrote the 
Black Jack program, which Ill present in 
this article. The source code for the pro- 
gram is available electronically (see “Re- 
source Center,” page 3). 

The program acts as a Black Jack deal- 
er for a single human player. Before the 
game begins, the dealer asks how much the 
user would like to bet on the game. The 
following interaction indicates that the user's 
account has $95, of which the user bets $5. 


[95]Bet: 
5 


The computer starts the game by deal- 
ing two cards for itself and two for the 
player. One of the dealer’s cards is face 
down. All other cards are face up. The fol- 
lowing display indicates that the dealer 
has one face down card and one Ten of 
Hearts. The player has a Nine of Dia- 
monds and Ace of Clubs. 


D> * TH 
P> 9D AC 


The dealer waits for the user to request 
an action. The user types “s” to stay with 
the current hand, or “h” (for hit) to request 
an additional card. Once the user types 
“s,” the dealer begins dealing additional 
cards to itself. In this case, the dealer also 
stays, resulting in a tie game. This is rep- 
resented as: 


D Ties> QH TH 
P Ties> 9D AC 


Figure 1 shows a graphical version of 
the Black Jack program. 


Implementing Black Jack 

The main functionality of Black Jack is im- 
plemented by the following object (class) 
definitions: | 


e Card represents each card of the deck 
in terms of suit and rank. 

e Deck represents a shuffled deck of 52 
cards and provides methods to begin a 
new game and deal a card. When a new 
game begins, all cards dealt in the pre- 
vious hand are added to the deck’s pile 
of used cards. If the shuffled deck is 
empty when a new card is to be dealt, 
the pile of used cards is reshuffled and 
the next card is dealt from the result. 
Hand represents the cards dealt to a 
particular player. The toString() method 
returns a string representation of the 
cards held in the hand. The dis- 
playCards() method causes the cards 
to be drawn on a GUI device. This 
method did not exist in the first imple- 
mentation of Black Jack; it was added 
to the program when we ported Black 
Jack to an embedded system that sup- 
ported a 600x800 pixel flatpanel VGA 
screen. The addCard() method adds a 
new card to the hand. The calcValue() 
method returns the value of the hand. 
The empty() method discards all the 
cards in the current hand, in prepara- 
tion for a new game. The hide() and 
reveal() methods control whether the 
first card in the hand is face down. 


Unlike PERC and Java, the picoPERC 
core does not support automatic garbage 
collection. Thus, it is necessary to explic- 
itly delete each object after its useful life- 
time has expired. This Black Jack program 
is written as a persistent embedded pro- 
gram. Black Jack starts when the embed- 
ded computer is powered up. It contin- 
ues to run until the computer is powered 
down. There is no need to ever reclaim 
the main data structures of the program. 
Black Jack allocates 52 Card objects to 
represent the cards of a standard deck. 
These cards are never recycled. They are 
simply moved back and forth between the 
deck, the dealer’s hand, and the player’s 
hand. The only dynamically allocated ob- 
ject that must be explicitly reclaimed is the 
string created by Hand.toString(). Note 
the invocations of MM.delete() in the 
showPlayer() and showDealer() methods 
of the BlackJack class. 

To reduce memory footprint, picoPERC 
does not include all of the standard library 
services that are built into standard Java. 
Since these services are redundant with 
the services provided by many embedded 
and real-time operating systems, the ser- 
vices can be easily implemented using na- 
tive methods. A PERC native method is a 
method implemented by a C program. In 
the Black Jack program, native methods 
implement keypad input, LED output, 
graphics output, and random number gen- 
eration. The interfaces following represent — 
these services: 
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e Input is an interface that describes the 
getBet() and getAction() methods. These 
methods are used to determine how 
much money the player wants to bet 
and whether the player wants to receive 
another card. 

¢ Output is an interface that describes the 
clearCardArea(), drawCard(), and 
print) methods. clearCardArea() eras- 
es all of the cards that had been previ- 
ously drawn. drawCard() draws a sin- 
gle card at a selected position within 
the card area. print() displays a single 
integer or a single string of characters 
in the text window. 

e RandGen is an interface that describes 
generate(), a method that produces a 
random integer number. 


These are represented as interfaces be- 
cause we want to provide different im- 
plementations of these services, de- 
pending on execution environment. All 
of the code can be run either on a tra- 
ditional Java Virtual Machine or on a 
picoPERC Virtual Machine. 

For development, testing, and debug- 
ging, we used KeyboardViaJava to im- 
plement Input, ASCIIViaJava to imple- 
ment Output, and RandGenViaJava to 
implement RandGen. These classes im- 
plement the required services using stan- 
dard Java libraries. This is portable and 
convenient, but it adds considerably to 
the size of the application because it re- 
quires inclusion of the standard Java li- 
braries. Once the basic functionality of 
the application had been tested and de- 
bugged, we ported the code to the small 
LED embedded device by replacing these 
classes with KeyboardViaNative, ASCII- 
ViaNative, and RandGenViaNative, re- 
spectively. 

These three classes use native meth- 
ods, which are not shown, to implement 
the various services. The ASCI/ViaNative 
methods assume the output device sup- 
ports only text output. The correspond- 
ing clearCardArea() and drawCard() 
implementations are empty, meaning 
that there is no attempt to display any 
graphics on this device. 

Later, we ported the code to the VGA 
flatpanel device by modifying the im- 
plementations of KeyboardViaNative and 
RandGenViaNative, and by replacing 
ASCIViaNative with class GUI. The ap- 
propriate implementation of native meth- 
ods depends on the underlying hardware 
and the services that are provided by the 
host operating system. 


Preparing the picoPERC ROM Image 

After the Black Jack program was test- 
ed, we prepared a version to be placed 
into the embedded device. First, we com- 
piled the various Java source files (also 
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Table 1: Memory layout. (*The in-memory sizes of Java classes are 
approximated by scaling individual class file sizes proportional to the ratio 
between the sum of class file sizes and the size of the ROM memory image.) 
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available electronically) using the javac 
program supplied in JavaSoft’s JDK 1.0.2: 


javac BlackJack.java Deck.java 
Hand.java Card.java 

javac Input.java Output.java 
RandGen. java 

javac KeyboardViaJava.java 
ASCIIViaJava.java 
RandGenViaJava. java 


We used our ROMizer tool to preload 
the various class files. The ROMizer re- 
solves symbolic references and replaces 
certain byte code instructions with more 
efficient representations. The command- 
line invocation: 


romize -f %perc%\images\ 
picopere classes 
-f BlackJack.classes 


invokes the ROMizer, telling it to preload 
all of the classes identified in the 
%perc%\images \picoperc.classes and 
BlackJack.classes files. %perc%\ images \ 
picoperc.classes names the files that repre- 
sent the core picoPERC classes. Included 
among these are simplified definitions of 
java.lang.Class and java.lang.String and 
certain exceptions that may be thrown by 
the Java interpreter. BlackJack.classes names 
the classes that represent the Black Jack 
demo. For each loaded class, the ROMizer 
automatically loads all classes that are di- 
rectly referenced by the class if those class- 
es have not already been loaded. 

We compiled and linked an earlier ver- 
sion of the Black Jack program for the 
ARM processor running on a Pixel Press 
board supplied by Applied Data Systems 
(http://www.flatpanel.com/) and ana- 
lyzed the resulting memory image. Table 
1 describes the memory layout. The 
subtotal is 4344 bytes larger than the ac- 
tual size of the load image. This is be- 
cause the subtotal was calculated by sum- 
ming individual object file sizes, whereas 
the total load image size is the size of the 
linked program. Certain information is 
discarded from the object files in the pro- 
cess of linking the various files together. 
We are currently working on a number 
of optimizations designed to shrink the 
in-memory sizes of the Java classes by 
approximately 50 percent. 
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EMBEDDED SYSTEMS 








Building Intelligent, 
Web-Based Control 





Embedding the 
EMIT web server 





Tom Milligan and Steve Coffin 


s engineers living in an arid climate, 
we've always been annoyed by 
“dumb” sprinklers that come on at 
ridiculous times, such as in the mid- 
dle of a hot, sunny day when much of the 
water is lost to evaporation, or while it’s 
raining and totally unnecessary. Sprinklers 
have primitive interfaces and are limited 
by when, where, and how they can be 
programmed. 

Therefore, we decided to automate a 
sprinkler system using the Embedded Mi- 
cro Internet Technology (EMIT) toolkit 
from emWare (the company we work for). 
EMIT is a toolkit for embedding web 
servers on devices, and developing web- 
based user interfaces. In this article, we'll 
describe our system design, explain how 
we built the device, and show how we 
developed the interface. 





EMIT Background 
EMIT includes five modular software com- 
ponents: 


e emMicro, a compact, special-purpose 
server. 


Tom and Steve are engineers at emWare Inc. 
They can be reached at eng@emware.com. 
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e emNet, a message-based protocol that 
combines packet and stream interchange. 

e Microtags, preprogrammed packets that 
define device controls such as switch- 
es, buttons, LEDs, and so forth. 

e emGateway, which expands microtags 
into their full parameters. 

e emObjects, a library of JavaBean com- 
ponents consisting of visual and utility 
objects. 





These components work together to dy- 
namically create the user interface for an 
electronic device. Figure 1 shows the EMIT 
architecture. When a user requests the de- 
vice interface from a web browser, the 
browser sends the request to emGateway. 
emGateway then translates the high-level 
user request and sends it to emMicro at 
the device. 

emGateway requests from emMicro an 
HTML page containing “Microtags,” con- 


densed tags that contain the information 
necessary to represent a specific device 
control (a slider, button, LED, or whatev- 
er). Condensing this information improves 
the space, resource, and bandwidth re- 
quirements of the device. 

After emMicro returns the HTML page 
with Microtags, emGateway substitutes 
each Microtag with a corresponding 
emObject. emObjects are JPEG/GIF im- 
ages or Java applets that represent device 
controls to the browser. The page with 
the substituted emObjects is then sent to 
the browser. 

EMIT components process user requests 
to view and set device information, dy- 
namically representing the results to the 
user from within the browser at the desk- 
top. For example, users may ask to flip a 
switch from “off” to “on.” Once a user 
makes the request, the browser invokes 
emGateway. Then, emGateway sends the 
request to emMicro at the device. 

When emMicro receives the request, it 
causes the device’s microcontroller or 
microprocessor to perform the request 
(flip the switch). Because device infor- 
mation has changed, the state of the em- 
bedded device changes, which will result 
in a state-change message being sent to 
any interface emObject (component) mon- 
itoring affected variables. 

The emMicro web server is about one 
KB, and the sprinkler controller applica- 
tion is about 900 bytes. If we had to build 
physical interface logic onto the device 
(the LCD, buttons, and so on), an addi- 
tional two to four KB of program space 
would have been required. The majority 
of the programming, however, is done 
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(continued from page 50) 
on the client side, so hardware resource 
requirements are kept to a minimum. 


Specifications 

The ideal sprinkler system (see Figure 2) 
would let you sit down at a computer and 
water the lawn from a Java-enabled 
GUI— at anytime, from anywhere. In ad- 
dition, the system should be able to wa- 
ter proportionately; that is, less in spring 
when it’s raining all the time anyway, and 
more in the dog days of July and August. 

Our system could also check the Na- 
tional Weather Service’s web site for pre- 
cipitation information. That way, you could 
program your sprinklers to come on or stay 
off based on one-, three-, or five-day fore- 
casts. Links to temperature sensors and/or 
moisture probes might also be available as 
alternatives or in addition to National 
Weather Service information. Either way, 
you could easily program your sprinklers 
to water in the predawn hours before hot 
days of continuous sunshine, and not to 
water when rain is the order of the day. 
When your computer is turned off and the 
sprinkler system isn’t able to access either 
the Internet or stored information con- 
cerning soil moisture, the system simply 
defaults to its regular schedule. 

On days when the predictions are 
wrong, you can pull up the sprinkler in- 
terface and reset the sprinklers accord- 
ingly. The sprinkler interface allows for 
customized usage in other ways. For ex- 
ample, you can turn off the system for a 
few hours or a few days, or to accom- 
modate a hard, driving rain. Also, forget- 
ful users don’t have to reset the system to 
prevent burning their lawns; the sprinklers 
have enough intelligence to turn back on 
after the allotted time according to their 
regular schedule. 


Building the Device 
We built the intelligent sprinkler system 
using about $15.00 worth of garden-variety 
components. Figure 3 is the schematic for 
the device. The microcontroller is the 
AT89C2051, a 20-pin 8051 derivative from 
Atmel with two KB of program space. (For 
more information on programming 
AT89C2051, see “Atmel’s AT89C2051 
Microcontroller,” by Dhananjay V. Gadre, 
DPJ, July 1997; available electronically, 
see “Resource Center,” page 3.) We in- 
stalled an RS-232 transceiver to provide 
external communications. We wrote some 
code to build a real-time clock and cal- 
endar, and an alarm clock that would se- 
quence throughout the program and man- 
age all the various watering start times. 
We included the emMicro module to 
provide a remote interface for manually 
programming the system and customiz- 
ing the watering schedules. We included 
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(continued from page 58) 

a serial EEPROM device to store interface 
documents and program times in a non- 
volatile array, just to make sure that if the 
device loses power, it will retain essential 
information. In addition, we included a 





Figure 2: The sprinkler controller system. 
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vices. We used a 74hc138 decoder to al- 
low eight valves to be controlled using 
three lines from the microcontroller. And 
we made provisions on the schematic for 
an additional decoder that can provide 
control for as many as 16 valves. 

Figure 4 shows the programming logic 
for this device. The real-time clock/calen- 
dar for the sprinkler controller was im- 
plemented completely in software (see List- 
ing One; listings begin on page 70) and 
uses as its time base the microcontroller’s 
18.432MHz crystal. The clock has two main 
software components: an interrupt-service 
routine, and the clock routine itself. The 
8051’s hardware timer 0 is used to execute 
the ISR once every ten ms. The ISR must 
reset the hardware timer for the next in- 
terrupt and account for interrupt latency, 
which would cause the clock to drift off. 
Unfortunately, the 8051 architecture doesn’t 
support a 16-bit auto-reload mode, which 
would make it unnecessary to correct for 
interrupt latency; fortunately, this is not 
hard to correct. 

Because t0lo is small (zero), there is no 
real possibility of a rollover when the ad- 
dition is done, and therefore, no reason 
to account for a carry into the upper por- 
tion of the counter. The most important 
thing the ISR does is to set the bit vari- 
able called temMSbit. This is used to tell 
the real-time clock routine that a time tick 
has taken place. 

The other software component of the 
clock is the routine responsible for count- 
ing all the ticks created by the ISR. Basi- 
cally, the routine just counts up hours, min- 
utes, seconds, and days, and sets a few 
bits here and there to tell other routines 
when to operate. For instance, once every 
minute, the bit variable CheckWater is set, 
so that the alarm-clock routine knows to 
check all the scheduled watering times to 
see if any are scheduled to begin water- 
ing. There are two-day counters, one used 
for watering schedules based on the days 
of the week (Monday, Tuesday, Wednes- 
day, and so on), and the other for peri- 
odic watering (every other day, every third 
day, every fourth day, and so on). 

The counting routine (Listing Two) is 
called by the main calling loop in a 
round-robin multitasking methodology. 
It is called much more frequently than 
the ten-ms rate at which the timer ISR 
takes place, so it is sure not to miss a 
tick. Whenever the ISR sets the tenMSbit, 
the clock routine increments its clock reg- 
isters as necessary. It then clears tenMS- 
bit so that it doesn’t respond more than 
once to each tick. 


Programming the Controller 

There were three main steps involved in 
integrating the emWare software into our 
application: 
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(continued from page 60) 

1. Include the emMicro code module in 
the application. 

2. Create the resource tables. 

3. Insert calls into the entry points of 
emMicro. 


emMicro is usually included by placing 
an include statement after the application 
code. This tells the assembler or the com- 
piler to include the emWare code when 
building the executable. 

The resource tables are where all of the 
information about the application is spec- 
ified for the browser interface. Creating 
the resource tables really means editing 
the example tables included with EMIT to 
match the application’s needs. For exam- 
ple, the sprinkler controller primarily uses 
the Variable Table to make a large num- 
ber of variables— such as Hrs, Min, and 
Sec—visible to the web-browser interface. 
These tables tie the browser interface to 
the microcontroller application. Listing 
Three presents a small section of the Vari- 
able Table. 

The structure of each table is similar. 
The first byte is the number of elements 
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in the table. Our sprinkler controller uses 
7 variables, so the first byte of the sprin- 
kler controller’s variable table is 37. Next, 


MIT components 
process user 
requests to view and 
set device 
information 


there are two bytes to describe the type 
of each variable (byte or word), and 
whether the variable is read only or 
read/write. Finally, the variable names 
are placed into the table in the form of 
NULL-terminated strings. There are oth- 
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er tables that specify Events, Functions, 
Static Documents, and Capabilities. Once 
these tables are created, they are in- 
cluded at the end of the application 
source code either by using include 
statements or by simply directly includ- 
ing the tables. 

Placing calls to the emMicro entry points 
is straightforward, as there are only two, 
but there are some rules applications have 
to obey. One entry point is the serial port 
interrupt vector entry. emMicro has its own 
serial ISR, but the vector has to be installed 
into the microcontroller’s main vector table. 
The other entry is the main emMicro entry 
(called emMicroEniry). | 

For an application to coexist with em- 
Micro, the application must be written 
using a cooperative multitasking method- 
ology. In other words, each process, in- 
cluding emMicro, is called by a main call- 
ing loop, and each process must respect 
all the others by releasing the process 
control back to the calling loop in a time- 
ly fashion. Just what “timely” means de- 
pends on how often each process really 
needs to have the attention of the pro- 
cessor. emMicro needs to be called at least 
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(continued from page 62) 
often enough to receive each character 
from the serial port. 

In any case, no process should ever 
stall the processor in a tight loop waiting 
for some event to happen. That’s what in- 
terrupts are for. The only other rule is that 





no process should get exclusive use of 
the CPU registers. Each process can use 
them without regard to their previous 
state. Most experienced programmers 
abide by these rules anyway. Listing Four 
is the main calling loop for the sprinkler 
controller. 


Example 1: Creating a new emitjri object. 





Example 2: Establishing the connection between the image switch waterOnOff 


and the embedded variable Water. 


#include <iostream.h> 


class Animal 
{ public: 


}; 


class Cat 
{ public: 
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class Dog : 
{ public: 
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int main() 
{ Animal *p 
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: public Animal 
char *speak(int) 


public Animal 
char *speak(short) { return "bow-wow"; } }; 


for C/C++ 


Version 7.5 
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virtual char *speak(int) { return "silence"; } 


{ return "meow"; } }; 


new Dog; cout << p->speak(0); return 0; } 


ee 


The speech of dogs is supposed to be 'bow-wow' but instead the program prints ‘silence’. 
Why did the dog not bark? Do not be misled by the famous Sherlock Holmes mystery where 
the dog that did not bark was a 'friend' of the criminal. No reflection on our major media is 
intended either. Call if you need a hint or visit our web site at www.gimpel.com. 


There are several advantages to de- 
signing the application in the aforemen- 
tioned manner. For instance, the clock is 
set from the browser; not a single line of 
code in the microcontroller application 
has to be concerned with how the clock 
is set. Also, all the configuration parame- 
ters are adjusted without any code in the 
microcontroller. Forcing the browser to 
display new information (or fancy graph- 
ics) is as easy as changing the value of a 
variable in a resource table. If the micro- 
controller application calls for a special 
function to be executed when a button is 
pushed on the browser interface, you only 
need to write the code for the function; 
emMicro takes care of executing the func- 
tion for you. 


Developing the Interface 

Little code was needed to interface to the 
controller, since EMIT provides prebuilt 
emObject interface components and han- 
dles the communication details. To fur- 
ther simplify our task, we used Syman- 
tec’s Visual Café, and integrated our 
emObject components with their exist- 
ing component toolkit. This way, we 
could select the objects from the com- 
ponent palette, add them directly to the 
applet, and easily manipulate layout and 
component properties. 

The heart of the applet is an invisible 
object called emitjri, which is the Java 
Runtime Interface for EMIT. Example 1 
creates Sprinkler1, a new emitjri object 
configured to communicate with our 
sprinkler controller, which has a Manu- 
facturer ID of four and a Device ID of 
11. All emObjects are passed a reference 
to the emitjri object. This allows emOb- 
ject events to directly set variables on the 
embedded controller and allows changes 
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so you can be sure your product is correct by design. using Rapid PLUS™, or for more information, 


Then, Rapid PLUS™ can automatically and concurrently —_ contact us. 
generate compact and efficient C or G++ code, 


™ 
Rapid PLUS is a registered trademark of Emultek. E mM U Ite k 


Call 1-800-EMULTEK Web: www.emultek.com e-mail: info@emultek.com 
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J 
[he world’s fastest aaa 
to directly set and get embedded vari- 
able values, and also allow it to register 
} T t a callback to be notified when an em- 
S S CTY?) bedded variable value changes. 


An example of an emObject directly 
controlling an embedded variable is our 
watering on/off image switch. This object 
displays one of two images depending on 
its state, which is active or inactive. Ex- 
ample 2 establishes the connection be- 


is also the easiest to use. 


Traditional software configuration management systems tween the image switch waterOnOff and 
force you to work at their speed. That's fine, if you like the embedded variable Warer. This means 
running the high hurdles in fishing boots. Perforce frees 


that whenever an ActionEvent occurs 
(when the switch is pressed and released), 
you to work at your speed. the event value (True or False) is used to 


We gave Perforce its nimble user interface and its set the embedded variable Water. 


lichtine-f i; | ee Ait The embedded controller clock is a 
ighting- ast response because we love simplicity and hate good example of communicating variable 


waiting. And because we believe that nothing should get state change to the display applet. The 
between you and your work. embedded variables Hrs, Min, and Sec 

change regularly, and these changes need 
to be relayed to the applet. Example 3(a) 
sets up the callbacks for the embedded 
variables. Whenever the variable values 
change, the function notifyChange, which 


THE FAST SCM SYSTEM is defined in the object this (the applet), 
is called with the variable name and the 
info@perforce.com Wwww.perforce.com new value; see Example 3(b). Similarly, 


PUREST ERG EVeW Gece Omer NE tc Gr- We CME EGOs sr eyeiem | we could have omitted the notifyChange 
callback for Sec by setting a direct link be- 


tween the Sec variable and the emObject 
timeDisplay, with the function call in Ex- 
ample 3(c). 
M ® Since emitjri handled our communica- 
ONOTYPE tions requirements, the majority of our 
coding effort was spent defining event- 
handler functions. Whenever a button is 
pressed, text is entered, list items are se- 
lected, checkboxes are checked, and so 
on, events are generated. The event han- 
dling functions for 24/12 hour clock, 
AM/PM, and day select are characteristic of 
many of the event-handling functions that 
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Figure 5: The sprinkler user interface. 
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(continued from page 60) 
we had to create. Listing Five presents 
these functions. 

We built many features into the user in- 
terface that could not easily be accom- 
plished with a hardware solution, or by 
defining the interface completely within the 
embedded controller. This expanded the 
size of our applet (approximately 75 KB), 
but reduced the controller and support in- 
terface hardware costs while increasing flex- 
ibility. The end result is a Java class file that 
we could compress onto a 32-KB EEPROM 
and serve from the device. We could also 
let emGateway serve the client the prestaged 
class file along with the standard set of pre- 
installed emObjects. Figure 5 shows the 
completed user interface. 


For More Information : : 
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Example 3: (a Setting up the callbacks for the embedded variables, (b) the 
notifyChange function; (c) setting a direct link between the Sec variable and the DDJ 
emObject timeDisplay. (Listings begin on page 70.) 
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EMBEDDED SYSTEMS 


Listing One 


, 


I i a 


ChPb bee SPBEESESSSOSSELESDESSSSES 


SEXKKKKKKEKKKKKEKKK EKER Timer® Interrupt Handler ***#XKEKKEAEKEKKKKKKKEK 
SSSA ISA SSC IC ACI IR IO ICR ICR ICRC ICR 1 3. 3 1 2 221 2 a a a aK aR ok oo oo oko ok a oR ak ok ak ak 
; emWare doesn’t use or require the use of the timer interrupt. 

; System clock is 18.432Mhz 

; One machine cycle is 1/12 system cycle or .65104us 

; Timer @ counts up once every machine cycle. 

; Timer will fire an interrupt when it counts up and rolls over to 9. 

; We want a 10 ms interrupt rate .@10/.65104 = 15360 machine cycles. 

: = 3c00 hex 

; 10020 - c30G = c40O 
; So t@hi = Oxc3 and t@lo = @ 

INT_T@ push PSW 


push ACC 

Mov THG, #t@hi  ;Load timer reload register 

mov A, #t@low 

add A, TLO ;Account for interrupt latency 

mov TLO, A ;by adding preload value current value 
setb tenMSbit ;set this bit once every 10ms 

setb GreenLED ;Turn off green LED 

pop ACC 

pop PSW 

reti ;Return from Interrupt 


Listing Two 
doo daca aaa aera: Real Time Clock i aR aR ORR 


;Once every 10ms the timer ISR sets the tenMSbit. 
suse this routine to update the real time clock 


RealTimeClock 


jnb tenMSbit,RTCdone ;Just exit if temMSbit isn’t set 
clr tenMSbit ;only count the tick once 

inc Hundredths 3100 ticks to 1 second 

clr A 

mov Ri, Hundredths 

cjne Ri, #190, RTCdone ;Has one second elapsed? 

mov Hundredths, A 


ine Seconds ;Yes increment seconds 
setb waterbit ; 

setb ManSecBit 

mov Ri, Seconds 


clr GreenLED ;Turn on green LED 
;ISR will turn it off in 10ms 
cjne Ri, #68, RTCdone ;Has one minute elapsed? 
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mov Seconds, A 
inc Minutes ;Yes, increment minutes 
mov Ri, Skip 
cjne R1, #0, RTCskp ;Don’t check if Skip is not zero 
setb CheckWater ;See if it is time to start watering 
RTCskp mov Ri, Minutes 
cjne Ri, #60, RTCdone ;Has one hour expired? 
mov Minutes, A 
inc Hours ;Yes, increment hours 
mov Ri, Hours 
cjne Ri, #24, RTCdone ;Has one Day expired? 
mov A, Skip 
jz RTCnoskip ;is the skip watering variable zero? 
dec Skip ;decrement skip-watering variable 
clr A 
RTCnoskip mov Hours, A 
inc Days 
mov Ri, Days ;Has one week expired 
cjne Ri, #7, RTCmod 
mov Days, A 
RTCmod 
inc DayMod ;Increment modulus 120 day counter 
mov R1, DayMod ;Have 120 days expired? 
cjne Ri, #120, RTCdone 
mov DayMod, A 
RTCdone 
ret 


Listing Three 


;*** variable table send in response to GetVarSymbols request from host **** 


EMvartable .byte 37 ;= # of variables 
.byte @ ; ATTRIBUTE byte for each variable 
.byte BYTETYPE+READWRITE ; TYPE byte for each variable 
.byte @ 
.byte BYTETYPE+READWRITE 
.byte @ 
.byte BYTETYPE+READWRITE 


.text “water \0@0” 
.text “DayMod\000” 
.text “Days\000” 
.text “Hrs\@00” 
.text “Min\@00” 
.text “Sec\@00” 


;Null terminated string for each variable 


Listing Four 


Dc eo ica Main Idle Loop xa x xa aa KR a CR a A RR a a AR RR 
; Cooperative multitasking main calling loop. Each process should release the 
; processor whenever possible. No stalling allowed. User and em processes 











s should be replaced with lcalls. emMicroEntry must be called 
e per character tx time (at 9600 baud this is approx once per 96@us ) 


acall emMicroEntry ;Emware app layer also calls Link layer 
acall debug ;general purpose debugging stuff 

acall RealTimeClock ;Update the RTC when necessary 

acall Water ;Water the grass when appropriate 

acall CheckTime ;Check if its appropriate to water 
acall ManualCntrl ;allow for manual control 

ajmp main ;loop forever 


Dee: Main Tdle Loop End 333 GH GGG ECBO IRI IR A IK 10K 108 


‘ ’ e 
Listing Five 
/ Select 24 hour clock mode 
void hr24RadioButton_Action(Event event) { 
// get the value of the embedded variable “State” 
Object object = emJri.getJSJriVariable(“State”) ; 
int state; 
// disable am/pm buttons 
amRadioButton.enable(false) ; 
pmRadioButton.enable(false) ; 
isClock12 = false; 
state = ((Integer) object) .intValue() ; 
* instead of using a separate variable (wasted resource) to hold the clock 
mode set bit @ of the “State” variable to indicate 24 hour clock mode. */ 
enJri.setJSJriVariable(“State”, 
new Integer(state | @x@@01)); 
set_hours(); // update clock 
// change the clock to am 
void amRadioButton_Action(Event event) { 

* reset the embedded variable “Hrs.” Device runs in 24-hour mode so simply 
subtract 12 from the current hour. Since there is a JriLink to this 
variable “notifyChange” will be called and complete the clock update */ 

emJri.setJSJriVariable(“Hrs”, new Integer (hours-12)); 

} 

// set the current day 

void dayChoice_Action(Event event) { 
int day = dayChoice.getSelectedIndex() ; 

‘x set the current day from @ - 6 depending on the selected choice item */ 

emJri.setJSJriVariable(“Days”, new Integer(day)); 
dayLabel.setText (dayNames [day] ) ; 
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Transparent ATL 


Controls 





Add excitement to your 
web site with this 
ActiveX control 





Tom Armstrong and Mark Nelson 


isplaying transparent GIF files on 
web sites using standard browsers 
has become a popular activity. Un- 
fortunately, Windows 95/NT doesn’t 
provide much native support for the dis- 
play of images with transparent compo- 
nents. In this article, we'll use Microsoft's 
Active Template Library (ATL) to present 
an ActiveX control that displays a bitmap 
with a single transparent color. The con- 
trol can be used with Internet Explorer, 
Netscape Navigator (with the ScriptActive 
plug-in), Visual Basic, and most other Ac- 
tiveX containers. 

When referring to bitmapped images, 
“transparency” refers to an image that has 
an arbitrary number of transparent areas. 
When the image is drawn, the transpar- 
ent areas don’t obscure the area behind 
the image in the z-order. In the simplest 
case, you’ve seen this used on the Web, 
producing images of complex objects (like 
Figure 1) that appear to be floating on top 
of a background. 





Tom is the author of The Active Template 
Library: A Developer’s Guide (MET Books, 
1997). He can be contacted at tom 
@widgetware.com. Mark is the author of 
The C++ Programmer's Guide to the Stan- 
dard Template Library (DG Books, 1995). 
He can be contacted at markn@tiny.com. 
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Sites like Figure 1 typically have a stan- 
dard background pattern that shows up 
on many or all of their pages. The GIF on 
the left side has navigation bars that are 
used in an image map. A GIF file with the 
site logo is on the right. Both images fea- 
ture transparent areas that let them blend 
in with the background. 





How do They do That? 
After seeing this effect on the Web, we set 
to the task of duplicating it in an ActiveX 
control. We discovered a couple of dif- 
ferent ways to implement transparency in 
a control; the most straightforward requires 
you to set up a windows HRGN object, 
used to define a nonrectangular drawing 
area. Figure 2 is an example of a draw- 
ing region. As long as you can define your 
region as a path connecting a series of 
points (or elliptical regions), you can cre- 
ate a drawing region that Windows un- 
derstands. 

Once you have a drawing region de- 
fined, transparency is easy. In the simplest 
case, you take advantage of the Windows 





95 ability to define a nonrectangular win- 
dow. That done, you draw the window 
as you normally would. The areas outside 
the region are drawn by whatever resides 
behind the window. 

Defining a drawing region works well 
with containers that implement Microsoft's 
OCX 96 specification (which ships with 
the ActiveX SDK). In the best case, you 
can implement two-pass drawing, which 
lets you draw the foreground of your con- 
trol first, then lets the container draw the 
background. This provides fast, flicker- 
free drawing of controls. 


Ointment Ready, Enter Fly 

Defining drawing regions is great for some 
applications, but might not be a general- 
purpose technique. We wanted to be able 
to draw any type of image with randomly 
configured transparent areas. This means 
having transparency controlled on a pixel- 
by-pixel basis. Attempting to define a re- 
gion for any arbitrary bitmap is simply ask- 
ing for a headache. 

For our control, we used a masking 
technique described by Ron Gery in an 
MSDN article “TRANSBLT Demonstrates 
Bitmaps with Transparency,” 2/15/96 ID: 
Q973065) that uses a simple masking tech- 
nique, making it trivial to use arbitrarily 
complex transparent regions. The Gery al- 
gorithm assumes that you have a bitmap 
with a single color that defines the trans- 
parent area. Given that, the drawing por- 
tion of your code needs to execute the 
following steps: 


1. Create a monochrome bitmap of the 
same size as the bitmap you are going 
to draw. 

2. Set all the transparent pixels in the 
monochrome bitmap to 1, and all the 
opaque pixels to 0. 
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(continued from page 72) 

3. XOR the screen region with the bits in 

your image bitmap. 

4. AND the screen with your monochrome 
bitmap. This leaves all the transparent 
areas unchanged, and sets the opaque 
areas to black. 

. XOR the screen region with the image 
bits again. This sets all the transparent 
areas back to their original color, since 
the two XOR operations cancel one an- 
other. The opaque areas now contain 
the desired image bits, since XORing 
with solid black is the same as simply 
setting the bits. 


WN 


This drawing algorithm is simple to im- 
plement using standard Windows raster 
operations. 


Background Checks 

While this drawing scheme works with 
containers that comply to OCX 96 rec- 
ommendations, it does make an impor- 
tant assumption: The XOR/Mask sequence 
preserves the background behind trans- 
parent areas, but only if the background 
has already been drawn. 

To programs like Internet Explorer 3.0 
(IE3), the ActiveX control is a child win- 
dow responsible for drawing its entire rect- 
angular area. When IE3 is drawing the 
background for a web page, it excludes 
the areas occupied by child windows, al- 
lowing them to draw their own back- 
ground. If you’ve set up a background 
GIF, IE3 won't bother to draw it in any 
areas covered by your control, making 
any attempt at transparent drawing 
doomed from the start. 

Microsoft documented a way around 
this problem in Knowledgebase article 
Q165073, which provides a code snippet 
you can drop into a control’s WM_ERASE- 
BKGND handler. Listing One (listings be- 
gin on page 105) is MFC code similar to 
that presented in the Knowledgebase ar- 





Figure 1: Transparent GIFs on the Web. 


Figure 2: A windows drawing region. 
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Figure 4: Adding 


ticle. This code sends a WM_ERASE- 
BKGND message to Internet Explorer, 
along with the device context for the Ac- 
tiveX control. This convinces Internet Ex- 
plorer to draw the background behind the 
control as if it didn’t belong to a child win- 
dow. This code does not cause any trou- 
ble for more sophisticated containers (such 
as IE4) because the WM_ERASEBKGND 
never gets sent to controls. Newer con- 
tainers that implement the OCX 96 spec- 
ification don’t create child windows for 
each embedded control. Instead, each 
control renders itself directly on the con- 
tainer’s device context. 


Putting it Together with ATL 
We created an ActiveX control called 
TransCtl that displays an eight-bit BMP 
file, and treats all solid white areas 
(RGB(255,255,255)) as transparent. To 
keep things simple, the control doesn’t 
do any palette management, so if you 
use it on 256-color displays, your bitmap 
should only use the 20 system colors. 

We created this control using Visual C++ 
5.0 and the ATL. Programming with ATL is 
somewhat more difficult than programming 
with MFC, but the resulting ActiveX con- 
trols are usually faster, smaller, and can 
eliminate dependencies on corpulent DLLs. 

With command-line compilers, we 
could have just included a single C file 
that held all the source code for this pro- 
ject. Things aren’t so easy with IDEs that 
feature Wizard-based development. The 
instructions for building this project with 
Visual C++ look more like a recipe in a 
cookbook, with an interesting mixture of 
dialog boxes, button presses, menu op- 
tions, and code snippets. 

To build an ATL project using Visual 
C++, select File | New | Project | ATL COM 
AppWizard. The dialog box asks you for 
a Project Name, which we set to Trans- 
parentControl; and Location, which we 
filled in with C:\. We selected the default 
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values on the next dialog, creating a DLL 
without MFC support, and no merging of 
proxy/stub code. See Figure 3. 

At this point, we need to create the ac- 
tual ActiveX control that will display our 
BMP files. To do this, we used Insert | New 
ATL Object, which brought up the ATL 
Object Wizard. We selected Controls | Full 
Control and clicked the Next button, which 
brought up a tabbed properties dialog. In 
the edit box that asks for a short name, 
we entered TransCtl. In the Miscellaneous 
sheet of the properties dialog, we turned 
off the Opaque option. 

The Stock Properties tab lets you spec- 
ify which stock properties your control 
will support. Our control needs the 
ReadyState stock property, which is sup- 
ported by ATL, but is absent from the 
Stock Properties tab. To include it in our 
control’s implementation, you have to add 
the code yourself. 

Clicking the OK button at this point gen- 
erates all the files necessary to create the 
basic ActiveX control. You can build the 
control and insert it into Microsoft’s Ac- 
tiveX Control Test Container (accessible 
from the Tools menu item). But before it 
can do anything interesting, you need to 
add the code that assigns an image file to 
the control, loads the image file into mem- 
ory, and draws it on the screen. 


Dealing with the Image File 

Since TransCtl is to display an image file, 
we need a property that gives the name 
of the file. Adding properties to an ATL 
project is another Wizard-driven process. 
In the ClassView tab of the Workspace 
window (Figure 4), you should have two 
classes defined: CTransCtl, and ITransCtl. 
Right-click on the ITransCtl item in the 
tree, and select Add Property. Choose a 
property type of BSTR, which is the 
canonical string type used in ActiveX con- 
trols. Choose a property name of Image- 
File, and use the default get and put func- 
tions the Wizard suggests. 

While the VC++ Wizards help you by 
adding the properties to the control, they 
don’t actually supply the internal imple- 
mentation of the property. This is done 
manually. First, you add a new member 
variable called m_bstrImageFile, of type 
CComBSTR to class CTransCtl in file Trans- 
Ctl.h. (available electronically; see “Re- 
source Center,” page 3). Next, you initial- 
ize it with an empty string in the CTransCtl 
constructor. Finally, you implement the 
get_ImageFile() and put_ImageFile() func- 
tions in TransCtl.cpp (also available elec- 
tronically). 

The put_ImageFile() function in Trans- 
Ctl.cpp has to do more than just copy a 
string into your member variable. It clears 
our internal bitmap that actually holds the 
image, starts the download process, and 
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informs users that the control is in an in- 
complete state. 


Asynchronous Properties 

Until recently, a control’s property values 
(such as the bits of an image) were stored 
within a file managed by its container. As 
the control is instantiated within the con- 
tainer, its property values are provided by 
the container via COM-based interfaces. 
This process of binding a control’s prop- 
erty values is a synchronous operation. 
The container opens its storage file, lo- 
cates the control’s persistent data, queries 
for the control’s persistence interface (usu- 
ally [PersistStreamInit or [PersistProperty- 
Bag), then calls [Persist*.::Load, whereby 
the control sets its property values. 

This works fine for most control prop- 
erties (such as background color and fonts) 
because the data is small. However, large 
property values (like the bits of an image) 
in low-bandwidth environments are a 
problem. 

In early 1996, Microsoft developed a COM- 
based technology called “Asynchronous 
Moniker” to solve the synchronous-binding 
problem. Microsoft also provided an im- 
plementation of the specification called an 
“URL Moniker.” 

Monikers are used to name specific in- 
stances of COM classes. A moniker is a 
COM object that implements the JMoniker 
interface and encapsulates the details of 
instantiating a particular class instance. 
The moniker obscures from the client the 
process of locating, instantiating, and ini- 
tializing a specific COM class. In other 
words, clients work through the standard 
[Moniker interface or the MkParseDis- 
playNameEx API and can ignore class- 
specific details. 

Monikers are typically used to bind to 
a specific instance of a COM object. They 
can also be used to bind to a remote stor- 
age or stream. Microsoft’s URL moniker 
implementation lets a client application 
asynchronously bind to a web resource 
specified by a URL. The client application, 
by implementing the /BindStatusCallback 
interface, treats the resource as a stream 
of bytes (via an [Stream pointer). This 
means a client can download a remote file 
asynchronously by specifying only its URL, 
which is exactly what we need for our 
ImageFile property implementation. 

The JmageFile property then becomes 
a string that holds just the URL for our 
bitmap file. The persistent data for our /m- 
ageFile property is no longer the bits of 
the image, but an embedded reference to 
them. To see how this looks in a web 
page, examine the JmageFile property in 
Listing Two. 

Loading the image proceeds like this: 
As the control is instantiated by the con- 
tainer, it passes the image’s URL as part 
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of the synchronous property binding pro- 
cess. Once we have the URL, we initiate 
the asynchronous download process. The 
download occurs in a background 
thread, and when it is complete, we ren- 
der the transparent control using our im- 
age data. 

If you look closely at TransCtl.cpp, you'll 
notice that instead of using ATL’s CBind- 
StatusCallback class, we developed and 
used a derived class named COurBind- 
StatusCallback. The CBindStatusCallback 
implementation provided with ATL 2.1 
isn’t quite ready for prime time. We 
tweaked a few of its methods to get ev- 
erything working properly. You can ex- 
amine the minor changes by download- 
ing the code. The download process 


Over 30 protocols: 
FTP, SMTP, POP, 





begins with a call to COurBindStatus- 
Callback::Download. Listing Three pre- 
sents the Download() method. 
COurBindStatusCallback is itself an ac- 
tual COM object. Whenever we need to 
download a URL-based file, we just call 
the static Download() method. As you can 
see from Listing Three, Download() cre- 
ates an instance of itself and then starts 
the download process. The key point is 
that we pass in our control’s pointer and 
the address of a callback method. As the 
download proceeds, we will be notified 
through our OnData() callback method. 
The implementation of our callback 
method is shown in TransCtl.h. As data 
arrives, the OnData() method is called 
with a buffer containing the remote data 
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and a flag indicating if this is the first, in- 
termediate, or last data notification. We 
use the flag value to manage our down- 
load buffer. When the download is fin- 
ished, we set the control’s ReadyState 
property to complete. The rest of the code 
is general buffer management. 


Are You Ready? 

Before the addition of asynchronous prop- 
erties, a control was ready as soon as it 
was instantiated and initialized by its con- 
tainer. Now, for those controls with asyn- 
chronous properties, their internal state 
may not allow immediate use after load- 
ing. Microsoft has added a standard con- 
trol property called ReadyState. Controls 
that implement asynchronous properties 
use ReadyState to communicate readiness 
to their users. Listing Four shows the de- 
fined values for the ready state property. 

Most of the states are self explanatory. 
The difference between the INTERACTIVE 
and COMPLETE states is up to the con- 
trol implementor. Those controls that pro- 
vided interaction with users (ike mouse 
clicks) should move to the interactive state 
as soon as possible, even if asynchronous 
downloading is still in progress. 

For example, you might develop a but- 
ton control that displays a bitmap. The 
control should move to the interactive state 
and let its click event fire even as the 
bitmap is downloading. Once the down- 
load is finished and the bitmap is ren- 
dered, the control moves into the com- 
plete state. 

Since we don’t really provide any in- 
teractive behavior for our transparent con- 
trol, we stay in the loaded state until our 
bitmap has finished downloading. By ex- 
amining the OnDraw code in TransCtl.cpp 
and Listing Five, you can see that we de- 
fer rendering of the control until the 
bitmap download is complete. 

There is also a standard-control event 
called ReadyStateChange that you can use 
to directly notify users that the ReadyState 
property has changed. For simplicity, 
we've left out this detail. Adding support 
for the ReadyState property to our con- 
trol is easy. Add a data member of type 
long with the name of m_nReadyState, 
and update the control’s IDL file with the 
get/put methods; see TransparentCon- 
trol.idl (available electronically). Your con- 
trol must also derive from CStockProplmpl 
instead of [DispatchImpl, and you need 
to add the property to your control’s prop- 


erty map. 
We’ve got DIBs 


URL Monikers treat URL resources as a 
stream of bytes. It is up to the client ap- 
plication to add meaning to the returned 
stream. In our example, the stream con- 
tains a Windows DIB or bitmap. A DIB 
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object is different than a GIF file in that it 
doesn’t support progressive rendering, so 
our implementation requires the complete 
bitmap before we can draw the transpar- 
ent image. 

In most cases, working with DIBs in- 
volves reading the bitmap structure in from 
a local file and working with a handle to 
the DIB. There’s even an API call (Load- 
Image) that makes this easy. However, in 
our case, we have to manage a DIB struc- 
ture in memory that is assembled asyn- 
chronously. Once the structure contains 


Once you have 
a drawing 
region defined, 
transparency 
IS easy 


all of the necessary information, we need 
to realize the DIB within a Windows de- 
vice context. To encapsulate this behav- 
ior, we developed a DIB management class 
called CDib, which is loosely based on 
the WebImage example in the Win32 SDK. 
(The header file dib.h is available elec- 
tronically.) 

As data is downloaded, it is stored in a 
buffer in our control’s implementation 
class. Once the bitmap data is download- 
ed, the control’s ReadyState transitions to 
complete. This lets code from OnDraw() 
(Listing Six) execute. The buffer informa- 
tion is passed to the DIB class, a device 
context is created, a DIB section is creat- 
ed, and the actual bits of the image are 
passed to the DIB section. 


Displaying the CDib Object 

Once the image has been loaded into 
memory, all we have to worry about is 
displaying it. Compatibility with Internet 
Explorer 3.0 and older containers means 
we need to add a handler for WM_ERASE- 
BKGND, and use it to spoof the contain- 
er into drawing the background for our 
control. TransCtl.h presents the code 
added to the CTransCil definition to pro- 
cess this message. First, we add an entry 
to the message map for the control, then 
we add the message handler itself. Es- 


sentially, this handler passes the WM_ 
ERASEBKGND message up to the parent 
window, along with a copy of the con- 
trol’s device context. (The ATL Object Wiz- 
ard doesn’t help you add message han- 
dlers; this is a manual process.) 

The actual drawing of the image is 
done in the OnDraw() member function 
of the CTransCtl class. OnDraw() is called 
by the ATL framework, passing in a struc- 
ture that contains various bits of infor- 
mation needed to paint information on 
the screen. Our implementation isn’t much 
more complicated than it would be if we 
were simply displaying the BMP file. We 
have to add the code to create the 
monochrome bit mask, then perform the 
XOR/AND/XOR drawing methods previ- 
ously discussed. 

Listing Seven creates a monochrome bit 
mask in a compatible device context, re- 
lying on a Windows-specific characteris- 
tic of monochrome bitmaps. When copy- 
ing from a color bitmap to a monochrome 
bitmap, any color pixel that’s identical to 
the color bitmap’s background color will 
get set to 1 in the monochrome bitmap; 
all other bits will get set to 0. With this 
knowledge in hand, you can see how the 
code will create the bitmap needed by the 
masking algorithm. 

Once the mask has been created, we 
can do the actual drawing using three de- 
vice contexts: 


e di.bdcDraw contains the screen device 
context. 

e HDC( m_dib) is the device context con- 
taining the color bitmap for our image file. 

e hdcMask is the device context contain- 
ing the monochrome bitmap. 


The three consecutive BitBltC) calls in 
Listing Eight are all that is required to per- 
form the transparent draw. 


Conclusion 

To illustrate the techniques presented 
here, we’ve provided a demo that dis- 
plays transparent BMP files at http:// 
www.widgetware.com/. Since this is an 
ActiveX control, you can use it in both 
desktop and Internet applications— as 
long as your framework supports control 
containment. Fortunately, this includes the 
current releases of nearly all Windows de- 
velopment tools. But if you want to avoid 
the rigors of being an ActiveX user, you 
should still be able to implement the old- 
fashioned approach to reuse by cutting 
and pasting our drawing code directly into 
your application. Sometimes the old ways 
are still the best ways! 


DDJ 
(Listings begin on page 105.) 


Dr. Dobb’s Journal, March 1998 


ON TIME. ON BUDGET. 


ON C#BUILDER. 


C++BUILDER. THIS CHANGES EVERYTHING. 





Borland C+Builder™ delivers the future of C++ Development. This advanced, 
no-compromise, high-performance C++ compiler gets any C++ project finished ahead 
of schedule, running more reliably and performing faster than ever. For the first time, 
you can work within a C++ rapid application development environment that delivers 
high-powered projects, on schedule and on budget. It’s not just a development 





environment—it’s a competitive advantage that goes right to your bottom line. : 
| Cal ViSHal 

With C+Builder, you can build Enterprise-wide applications, Internet applications, ‘4 

NT services, commercial software or low level code without limits. Bottom line, 


Borland C+ Builder is the most advanced C++ development environment ever 





created, providing: 


Borland 
e Hassle-free C++ compilation with incremental link cycles and | G-Builder 
FastCompile to reduce turnaround time [ A 
e Full ANSI C++ for the complete Microsoft Windows 95, Windows NT, and 
Win32 APIs 
e A visual environment that decreases development time without putting 


a=>psneenae-<p | 





barriers between your source code and designers 

e Full support for industry standards like ActiveX, OLE Automation, ODBC, 
DCOM, DirectX, MAPI, Unicode, WinSock, ISAPI, and NSAPI 

e A reusable, open, and extensible class library 


® 
Borland is committed to investing in the future of your C++ development. It’s Borland 
no-sacrifices C++. You owe it to yourself to try C+Builder. Download today what other 
tools can only promise for tomorrow at www.borland.com/beppbuilder. To order, www.borland.com 
call Borland at 1-800-336-6464, offer code 52088, or visit us at www.borland.com 
AD LINK 67 
Copyright © 1998 Borland International, Inc. All rights reserved. All Borland product names are trademarks of Borland 


International, Inc. BI 5594 
a mm ml I I IES EE EDL LE I TIE ELI 








PROGRAMMER’S TOOLCHEST 





erformance improvements come in 

many different flavors. Changes in the 

algorithm usually bring the greatest 

gains: The obvious example of this is 
the replacement of bubble sort by quicksort 
or heap sort, reducing time complexity from 
O(n?) to O(n log n). The next level of im- 
provements usually comes in the actual im- 
plementation, when you address issues such 
as automatic versus heap-based memory, 
working set and virtual memory behavior, 
recalculating versus storing results, the struc- 
ture of loops and branches, and so on. This 
level of improvements largely determines 
the constants in front of the O(m log n) 
bounds. The third level of improvements in- 
clude processor-specific optimizations that 
affect caching behavior, internal parallelism, 
branch predictions, and the like. These may 
call for rearrangement of loops or use spe- 
cialized instructions (MMX intrinsics, for ex- 
ample) or library primitives. 

Clearly, what you need are different 
kinds of information for the different stages 
in the optimization process. At the algo- 
rithm level, high-level overviews of 
caller/callee relationships and intensity (in 
the form of annotated call graphs) are in- 
valuable. At the source-code level, you 
want function-by-function or line-by-line 
timings and counts. Finally, at the proces- 
sor level, you need instruction breakdowns 
annotated with the relevant processor be- 
havior. Traditionally, profilers have ad- 
dressed the middle (source code) stage, 
although new-generation tools for the oth- 
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er two levels are beginning to appear. 

In this article, I examine profiling tools 
that target Win32 C/C++ development. I 
used all of the tools for several weeks as 
part of my development process, and have 
selected a number of test scenarios (avail- 
able electronically; see “Resource Center,” 
page 3) that represent a broad range of ap- 
plications. Some of the tools support other 
languages or platforms as well (see the text 
box entitled “Visual Basic 5 Profiling”). Table 
1 presents an overview of the features each 
tool offers. The tools I used include: 


e Intel VTune 2.5. 

e Microsoft Visual C++ 5.0 (profiling tools 
only). 

e Rational Visual Quantify 4.0. 

e TracePoint HiProf 2.0. 

e Watcom C++ 11.0 (profiling tools only). 

e Win32 SDK (profiling tools only). 


Editor’s Note: As we go to press, Trace- 
Point’s web site Chttp://www.tracepoint 
.com) reports that the company has decid- 
ed to discontinue business operations effec- 
tive immediately. TracePoint was originally 
formed in 1995 from within DEC’s Western 
Research Lab, and became an independent 
company in 1997. Perhaps another compa- 
ny will pick up either TracePoint’s technolo- 
gy and/or the HiProf and Visual Coverage 
pieces. We'll keep you posted. 


Intel VTune 2.5 

Intel makes microprocessors, and its 
VTune profiler reflects that fact. The ba- 
sic profiler uses sampling to obtain mea- 
surements of the program under test, but 
also includes a variety of static and dy- 
namic code-analysis tools that help you 


make the most of the Intel processors at, 
shall I say, a painful level of detail. If you 
are interested in pairing issues, instruction 
penalties, and processor cache misses, 
you're in for a treat. 

The operation of the program is straight- 
forward: You specify which program to pro- 
file, what options to use for the profiler and 
program, and off you go. VTune executes 
the program, collects samples, and (when 
its done) displays the Modules Report — 
the first of a number of bar charts showing 
activity in your program and the rest of the 
system. From there, you drill down into ar- 
eas of interest to obtain other graphics, or 
annotated source-code listings. 

As an alternative to sampling, the VTune 
Code Analyzer performs a static-code ana- 
lysis of your program and provides infor- 
mation about the expected performance 
and low-level behavior of the various In- 
tel processors. The same information can 
be obtained through Dynamic Assembly 
Analysis, which analyzes small sections of 
your program in great detail by actually 
running the entire program and simulat- 
ing the performance of the area of inter- 
est instruction-by- instruction (as opposed 
to using the sampling method applied else- 
where). All methods can show your source 
code (where available) interspersed with 
assembly code, and annotated with re- 
marks about processor performance is- 
sues for the Intel processors. 

To help you translate this information 
to the source-code level, the C and For- 
tran Code Coaches point out source-level 
improvements. They require preprocessed 
source code to do so, and accept a make- 
file, command line, or ready-made pre- 
processed source file as input. If you select 
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(continued from page 78) 

an area in this file (normally a function or 
possibly a nested loop) and invoke the 
Code Coach, you get specific advice for 
the optimization opportunities that the 
Code Coach recognizes, most of which 
have to do with loops and branches. 

In addition to the actual profiler, the 
VTune package also contains the Perfor- 
mance Toolset with C/C++ and Fortran 
compilers that can be plugged into the 
Microsoft Developer Studio environment, 
several numerics and signal-processing li- 
braries, and a wealth of reference infor- 
mation about the Intel processors. The an- 
noying thing is, however, that the access 
program is a Winl6 application that 
doesn’t recognize long filenames and con- 
sequently couldn’t start Acrobat Reader lo- 
cated in “C:\Program Files\Acrobat3.” 

So how useful was VTune during my 
development? Given that I don’t develop 
high-performance numerical codes or de- 
sign code generators for a compiler, the 
level of detail offered by VTune was well 
beyond my needs. As a C/C++ pro- 
grammer, I don’t have very fine control 
over the eventual instructions that are fed 
to the CPU, and after my initial amaze- 
ment over all these processor intricacies, 
I had little practical use for them. Things 
might have been different if the Code 
Coach would have worked properly, but 
try as I might, I never got beyond 
VTune’s message that it had encountered 
an error while parsing my (preprocessed) 
source-code files. 

The fact that VTune uses sampling as 
its data collection method means that ex- 
ecution speed is excellent, but that the 
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granularity (which is modifiable) is not al- 
ways sufficient to capture all required in- 
formation. In my constrained optimization 
test case for example, VTune completely 
missed the index operator that caused the 
initial performance hit— presumably be- 
cause the operator itself didn’t take long 
to execute per call. 

Then there are a few other matters that 
hamper effective use of the profiler. First, 


Instruction counting 
is used as the most 
accurate method in 

the best profilers 





it is inconvenient that the profiler sam- 
ples for a predefined amount of time. It 
does not stop sampling when your pro- 
gram terminates; you'll have to interrupt 
the sampling session manually in that 
case. Conversely, if your program runs 
longer, you'll have to adjust the length of 
the sampling session and try again. Sec- 
ond, there was no good way to restrict 
profiling to specific areas— in fact, the 
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samples would include all system activ- 
ities, relevant or not. Intel advertises this 
as a feature, but I’m inclined to see it oth- 
erwise. Third, the bars in the graphics 
were usually awfully thin, say a pixel or 
so. Since they are the main means of nav- 
igating through the profile data, I had a 
difficult time (even with a 20-inch mon- 
itor) pointing with the mouse cursor and 
selecting the correct one each time. That 
each drill down action brings up a new 
top-level window with yet more bars 
doesn’t help either. I tended to quickly 
lose track of all the windows and bars. 

To sum up, Intel’s VTune excels with 
processor-level performance hints, but I 
found it less useful for general-purpose 
work. It is harder to use than several of 
the others, the sampling makes for a fair- 
ly coarse granularity, and the informa- 
tion provided is too detailed for com- 
mon usage. However, static and dynamic 
analysis give you insight into processor 
behavior, and the electronic documen- 
tation on low-level optimizations and In- 
tel processors is valuable even if you 
don’t need to wring out every cycle of 
performance. 


Microsoft Visual C++ 5.0 Profiler 

Inside the Microsoft Visual C++ 5.0 (and 
earlier) box, you'll find a profiler. This 
profiler can be used to obtain line- or 
function-level timings and counts, and as 
a simple coverage tool at the line or func- 
tion level. The profiler tools consist of 
three console applications: 


e One that modifies the executable or DLL 
under test by thunking function calls 


C, C++ 


Sampling, Timing, 


Sampling 
API Counting 
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DIF, Tab- 
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Table 1: Profiler overview. ++= ae (in the case ar run-time overhead this means least auehbeao) += good, 0 =reasonable, 
—=marginal, — —=worst. *Note: Requires clipboard copy and paste. 
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(continued from page SO) 
and inserting breakpoints to divert the 
flow of control to the recording part of 
the profiler. 

e Another that is the actual recorder. 

e A third that creates output lists in a va- 
riety of formats. 


Normally, a batch file controls the oper- 
ation of the profiler tools and the Visual/ 
Developer Studio contains a command 
that runs this batch file for you with the 
correct executable name filled in. In that 
case, the output listing appears in one of 
the tabs of the Output Window. You 
should be aware that this only works for 
GUI applications; to profile console ap- 
plications, you revert to the command line 
and manually start the correct batch file. 

The profiling tools are fairly versatile, 
allowing you to fine-tune both the instru- 
mentation process (function level or line 
level, timings, counts, or coverage), and the 
recording process (determining start and 
stop points of the profile — including or ex- 
cluding specific functions or modules). It 
also lets you specify how the output listing 
should appear (sorted according to some 
criterion, or in tab-delimited format for 
use by Microsoft Excel and other tools). You 
can also merge data from different runs to 
obtain an averaged effect. However, the 
whole process is purely command line and 
batch-file-based (with some help from 
TOOLS.IND, and is definitely not compa- 
rable to the GUI-based competition. More- 
over, if you want graphical output, you'll 
have to use Excel or another tool. With- 
out them, you are looking at (sorted) lists 
of function and line timings. 

In actual use, these tools aren’t too bad. 
In fact, they are the sort of profiling tools 
that have been around for years on most 
platforms. If you are prepared to spend 
some time learning command-line options, 
understanding the profiling process, and 
working your way through the output list- 


ings, they will get you most of the way. 
The standard cases are all handled by a 
set of straightforward batch files. The 
things I missed most (besides ease of use) 
were the ability to establish caller/callee 
relationships and an easy means to an- 
notate source code with the profiling in- 
formation. In addition, the profiling over- 
head was noticeably larger than with other 
instrumentation-based profilers (as a 


My hunch before 
profiling was that 
the file I/O would be 
the problem 





group, they are much slower than sam- 
pling profilers are anyway). 

In summary, the profiling tools that come 
with the Microsoft C++ compiler are def- 
initely useful, but not as easy to use or as 
complete in their analysis options as the 
best of the flock. On the other hand, they 
are free once you have the compiler. 


Rational Visual Quantify 4.0 

Building on the same instrumentation tech- 
niques used in Purify/NT, Rational has in- 
troduced a profiler for C/C++, Java, and 
Visual Basic 5 (I only examined C/C++ 
programs). The tool comes with its own 


environment, from which you load the 
program to be profiled. Visual Quantify 
instruments the program and all the DLLs 
that it uses (saving the instrumented ver- 
sions under a different name), then runs 
it to collect profiling data. When finished, 
Visual Quantify displays the call graph 
(with the critical path highlighted), func- 
tion list, and session summary windows 
for the run. From here, you can access 
further information in the form of func- 
tion details (showing callers and descen- 
dants of each function) and annotated 
source code. 

There are many ways to customize Vi- 
sual Quantify’s mode of operation. You 
can choose between instruction counting 
(at the line or the function level) or func- 
tion timing as the measurement method. 
Certain Windows system modules, how- 
ever, are always timed rather than in- 
struction counted. Next, you specify the 
options to the program being tested (al- 
though output redirection is not support- 
ed). Finally, when the results are dis- 
played, the Filter Manager lets you hide 
or delete module or function data from 
the views. For enhanced control, a small 
API is defined that lets your program take 
control over its own profiling— starting 
and stopping data collection, clearing the 
buffers, and so on. Obviously, this requires 
modifications to your source code and re- 
compilation, which is not necessary if you 
stick to the GUI environment of Visual 
Quantify proper. 

Visual Quantify also works with mul- 
tiple profiling runs in a given project. 
With a few commands, data from dif- 
ferent runs can be merged to obtain an 
average or differed to see changes in per- 
formance. The latter facility is particu- 
larly helpful and uses the color coding 
in the usual call graph and function 
views to indicate where performance has 
improved or deteriorated. If you run a 
program with multiple threads, the profile 


Visual Basic 5 Profiling 


Point’s HiProf and Rational’s Visu- 
al Quantify incorporate support for 
Visual Basic 5 applications. They both 
rely on the fact that Visual Basic 5 is 


T: most-recent versions of Trace- 


able to create native-code executables 
with optional debug information in the 
same PDB format that Microsoft Visu- 
al C++ uses. Therefore, profiling na- 
tively compiled Visual Basic applica- 
tions is remarkably similar to profiling 
C/C++ applications: You create the ex- 
ecutable with debug information, start 
the profiler, let it instrument the appli- 
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cation, then run the application under 
control of the profiler. 

I have performed only limited test- 
ing of Visual Basic application profil- 
ing, but even so, I’ve found differences 
between HiProf and Visual Quantify be- 
yond those uncovered earlier. Overall, 
HiProf proved better adapted to Visu- 
al Basic: It showed both the names of 
the user-defined functions and of the 
__vbaXxx run-time support functions, 
while Visual Quantify denoted the lat- 
ter support functions as ever so many 
UnnamedTimedProcedures at different 





addresses. Furthermore, although both 
profilers caused a noticeable slowdown 
in the program under test (measured 
by wall-clock time), the slowdown was 
greatest with Visual Quantify. In the 
most extreme case, a test routine that 
took 7.2s to execute stand-alone, took 
75.88 with HiProf, and 305.7s with Vi- 
sual Quantify. It is important to note 
that I used Beta versions of both pro- 
filers, so things might improve in the 
final release versions. 


—Rv.d W. 
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shows the combined timings of all 
threads. By selecting a thread in the Call 
Graph and focusing on its subtree, the 
profile reduces to just that thread. 

In day-to-day development, Visual 
Quantify is a pleasure to work with. Its 
features are well thought out, its UI is 
intuitive and helpful, and the various 
views help to analyze the profiling data 
in several ways— from the high-level 
overviews and call patterns to detailed 
breakdowns per function and source- 
code line. With the information present- 
ed as is, you are likely to emerge with 
a better understanding of your program’s 
behavior than you thought possible. 

However, a few quirks arose during 
testing. For large profiles, Visual Quan- 
tify requires prodigious amounts of vir- 
tual memory (the README file advises 
to reserve 200 MB!). Furthermore, C++ 
filenames that had a bool parameter were 
not unmangled. Surprisingly Gin view of 
all the information available), the anno- 
tated source-code view does not show 
line counts, only line timings. Finally, Vi- 
sual Quantify is limited to Windows NT 
and Microsoft compilers. 

Nevertheless, Visual Quantify is an ex- 
cellent and professional profiler, accom- 
panied by good documentation. 


TracePoint HiProf 2.0 
The introduction of TracePoint’s HiProf 
1.0 profiler broke new ground for Win32 


profiling tools and HiProf 2.0 has sever- 
al improvements over its predecessor, in- 
cluding support for Visual Basic 5. The 
product has its own GUI-based work- 
bench from which applications are load- 
ed, instrumented, and run. The instru- 
mentation process prepares your 
executable and its modules for the data- 
collection run, but skips any modules it 
considers to be “system modules.” In- 
stead of instrumenting these modules, 
HiProf uses Call Site instrumentation — 
basically adding timing code to the callers 
of noninstrumented module functions. 
This cleverly sidesteps problems that 
might arise from modifying system mod- 
ules, but prevents data collection on func- 
tions only used inside those modules. Af- 
ter instrumentation, the program is run 
under HiProf’s control. The results are 
displayed in a variety of formats— the 
Function View (a list of functions), Hier- 
archical View (a pie-chart breakdown of 
function callers and descendants), Criti- 
cal Edge list, and Critical Chain. Further 
views include the Source View and a nav- 
igator view with tabs for profiles and 
modules. Wherever sensible, views are 
linked so that navigation in one view is 
tracked by the other views. 

HiProf’s operation is subject to a num- 
ber of settings, the most important being 
the actual measurement method: instruc- 
tion counting or function timing, with the 
proviso that calls to functions in system 
modules are always timed. Run-time op- 


tions determine the command-line argu- 
ments to the program under test (com- 
mand-line redirection is supported) and 
the use of HiProf’s console, which is a 
small control unit to pause and resume 
the profiling process, and to store snap- 
shots or clear the profiling data. For fin- 
er control, HiProf offers tracepoints, 
which are breakpoints in your program 
that cause HiProf to execute some action, 
such as starting or stopping data collec- 
tion, or storing a snapshot. Tracepoints 
can only be set on entries or exits of func- 
tions, but they are ideal for concentrat- 
ing on specific parts of your program 
without changing the source code. 

A given project may contain many 
snapshots, from different runs or differ- 
ent stages within a run. However, to com- 
pare two snapshots, you'll either have to 
start a second instance of HiProf and ar- 
range the views side by side, or use a 
command-line utility to merge or diff the 
data from different snapshots. This is 
probably the weakest point in an other- 
wise excellent product. For multithread- 
ed programs each thread is shown sep- 
arately and merging must be done with 
the command-line utility. 

In actual use, HiProf 2.0 pairs with Vi- 
sual Quantify in features and ease of use. 
With the introduction of new views in Ver- 
sion 2.0, the data analysis views are on par 
with Visual Quantify. They give excellent 
information from many different perspec- 
tives, and the internal synchronization 


TracePoint Visual Coverage 1.0 


parts and paths of your program 

have been reached during a partic- 
ular run. After instrumenting your pro- 
gram and running it through one or 
more test cases similar to TracePoint’s 
HiProf profiler, data is displayed on var- 
ious Coverage metrics. You can choose 
from function coverage (the percentage 
of functions reached by the test runs), 
line coverage (ditto, for source-code 
lines), code coverage (for CPU instruc- 
tions), edge coverage (for branches), and 
call-pair coverage (for call sites). Visual 
Coverage uses almost the same views as 
HiProf 1.0 does for displaying data, so 
youll find a function list, hierarchical 
view, source view, and, unique to Vi- 
sual Coverage, a function distribution 
view that displays a bar chart with cov- 
erage percentages. Where appropriate, 
views can be switched to display the dif- 
ferent forms of coverage data. Finally, 
since coverage information is usually col- 
lected over several test runs, both the 


Vo Coverage determines which 
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combined and the separate data can be 
viewed. 

Coverage tools are intended as an aid 
during testing. In particular, they help 
to find out which parts of a program 
have been exercised and which have 
not. To some extent, profilers can be 
used for the same purpose, but infor- 
mation like edge and call-pair coverage 
is difficult to obtain without special mea- 
surements. In addition, Visual Coverage 
sorts out all data for you and, as with 
the profilers, a clear presentation of the 
data greatly aids analysis. Visual Cover- 
age does an admirable job here; apart 
from the views mentioned, several high- 
er level selection options filter the data 
before they get to the views. To this end, 
the navigator window contains several 
predefined categories that isolate dead 
or unused functions, functions organized 
per class, or functions organized per 
module. If desired, further categories can 
be defined based on coverage type and 
cut-off percentages for filtering. 


In daily use, I found that using a cov- 
erage tool requires more discipline than 
using a profiler. While a profiler pro- 
vides instant gratification when you see 
the performance improvements, a cov- 
erage tool sits there as an administrator 
and points out that you still haven’t test- 
ed all your code. To be of any real use, 
therefore, you need to be systematic in 
your approach to test cases and you 
must be prepared to spend considerable 
time studying the coverage information 
and building new test cases. For con- 
sole applications, this is not normally a 
problem; for GUI applications, howev- 
er, this means that you have to instru- 
ment the application first, then use Ra- 
tional’s Visual Test or a similar test 
harness to run the instrumented version 
of the application through different test 
scenarios. The resulting coverage data 
can then be viewed with the Visual Cov- 
erage GUI. 


—R.v.d.W. 
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among the views means that you waste 
no time coordinating different sorts of in- 
formation. In combination with its trace- 
points, HiProf lets you tailor both data 
collection and data presentation without 
having to touch your source code. In fact, 
HiProf made profiling and optimizing an 
application almost addictive— a far cry 
from the laborious process that profiling 
used to be. 


Watcom C++ 11.0 Profiler 

Similar to Microsoft Visual C++ 5.0, the 
Watcom C/C++ compiler comes with its 
own profiler. This one is based on sam- 
pling and is packaged as two separate 
programs— one to collect the samples, 
the other to present results. In the Wat- 
com IDE, the Sample command collects 
the data, and the Profile command lists 
the results as a bar chart showing im- 
ages (sample run snapshots) and mod- 
ules. Drilling down leads you to indi- 
vidual functions and finally to source 
code annotated with bar graphs that in- 
dicate the percentage of time spent in 
that function. If you want, you can ex- 
port the sample data to DIF or comma- 
separated text files. 

The whole process has few frills. Un- 
fortunately, not much information is ob- 
tained either. The sampling process only 
records hits per function, and all you get 
(be it in graphical format or an exported 
data file), are the names and the hits per 
function. Although this does give a coarse 
picture of the program’s behavior, I found 
it insufficient for any sensible optimiza- 
tions. For example, my MkDep test case 
spends most of its time in the Windows 
function ReadFile(), and the Watcom pro- 
file never caused me to suspect the func- 
tion or its callers. Likewise, the index-op- 
erator problem in my constrained 
optimization sample was missed com- 
pletely. I assume that this basic approach 
to profiling stems from Watcom’s desire 
to implement the profiling tools on all plat- 
forms it supports, but I’m not so happy 
with the end result. 

On the whole, you will probably find 
the Watcom profiler insufficient for seri- 
ous profiling needs. All is not lost, how- 
ever. The Win32 SDK included with the 
Watcom compiler package contains some 
profiling tools that are much more use- 
ful, at least when it comes to profiling 
Win32 programs. 


Win32 SDK Profilers 

Often overlooked, Microsoft’s Win32 SDK 
contains a lot of useful programs. For pro- 
filing purposes, I found no less than five 
tools that would give information about 
an application’s performance, and that is 
without counting PVIEW or PERFMON. 
Some of the tools are very specialized, but 
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at least one has features that make it a vi- 
able alternative to the commercial com- 
petition— if you are willing to spend time 
learning it. Several of these tools assume 
specialized compiler or linker options (or 
compatibility), which may not be avail- 
able in all compilers. 


e APIMON (API Monitor) is a stand-alone 
GUI-based program that runs other ap- 
plications, keeping track of the Win32 
API functions they call (complete with 
parameter and return values), how much 
time they spend there, and a variety of 
other things such as heap checking and 
page faults. It is simple to operate and 
gives yet another view on your appli- 
cation’s behavior. However, there is no 
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way to relate the information to any spe- 
cific locations inside your program, and 
the tool does not provide for arguments 
to the program under test, which severe- 
ly limits serious testing of command-line 
applications. 

CAP (Call Attributed Profiler) most 
closely resembles the commercial pro- 
filers. It requires you compile your 
code with a special compiler option 
(/Gh for Microsoft C/C++) that inserts 
_penter() hooks at the start of each 
function. These hooks are resolved by 
linking with the CAP.LIB import library, 
and at run time the associated CAP.DLL 
module will be loaded and used to 
record the time spent per function. With 
the aid of some further programs (most 
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notably CAPVIEW) the captured pro- 
file is then displayed as an annotated 
call tree or as a list of function counts 
and timings (with time per function and 
time spent in descendants separated). 
Colors are used to mark the most crit- 
ical functions. I was really surprised by 
the usefulness of this program. Al- 
though it is not so versatile as the com- 
mand-line profiler that comes with the 
Microsoft compiler, it does provide a 
fairly accurate picture of the most crit- 
ical performance data, and it does so 
in a format that is very usable. 

FIOSAP (File I/O and Synchronization 
Win32 API Profiler) is designed to help 
identify I/O and synchronization bot- 
tlenecks in multithreaded programs, al- 
though you can also use it to monitor 
file I/O activity in single-threaded pro- 
grams. It uses a small helper program 
to patch your executable and reroute 
all KERNEL32 calls to its own FERNEL32 
module, which collects data and for- 
wards the call to the appropriate KER- 
NEL32 function. The data collection 
comprises file I/O functions and oper- 
ations on synchronization primitives 
such as semaphores, events, and mu- 
texes. The information can be used to 
assess the amount of time spent wait- 
ing on the various operations. Howev- 
er, the results are summed over all 
threads in an application, and it is up 
to you to find out what the actual caus- 
es of performance loss in this area are. 
PROFILE (Win32 Sampling Profiler) (not 
to be confused with the Microsoft Vi- 
sual C++ profiler of the same name) op- 
erates essentially the same way as the 
Watcom profiler and runs your program 
while taking periodic samples of the in- 
struction pointer’s location. The result 
is a text file that indicates the number 
of hits per function. While it operates a 
lot faster than CAP, I don’t find the in- 
formation it gathers very useful. 

WST (Working Set Tuner) helps you re- 
duce the working set of your program. 
Similar to CAP, WST requires recompi- 
lation with the insertion of _penter() 
hook functions and resolves these in the 
WST.LIB import library. At run time, the 
WST.DLL module takes frequent snap- 
shots of which functions were called 
during the period of time preceding the 
snapshot. The result shows which func- 
tions are used close together in time, 
and the WSTUNE program applies this 
information to produce a packing list 
for the linker that places temporally near 
functions also physically near, thus re- 
ducing the working set of your appli- 
cation. The whole process is something 
that you want to do when your appli- 
cation is almost ready for shipment, 
because during development the constant 
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addition and removal of functions in- 
validate any WST results rather quick- 
ly. Nevertheless, WST can give your ap- 
plications the final touch when it comes 
to performance. 


Case Studies | 
To examine these profilers, I selected a 
representative sampling of profiling tasks. 
All the applications are written in C or C++ 
and run on Win32 platforms. 


MkDep. This is a console application 
that reads C and C++ source files and gen- 
erates a list of #include dependencies suit- 
able for use in a makefile. On the whole, 
the program is I/O-bound. My hunch be- 
fore profiling was that the file I/O would 
be the problem, and I had spent time op- 
timizing this area with special buffering 
and so on. The benchmark I used through- 
out testing was the dependency list for 
MFC 4.2. In the original version of the pro- 
gram, it took about 5:00 (mins:secs) on my 
system to process all those files. In the ver- 
sion as it currently stands, this has been 
reduced to 0:25. 

The improvements came from reducing 
I/O traffic, but not in the way I expected. 
It turned out that the original version was 
spending 75 percent of its time in the _dc- 
cess() function I used to look for header 
files along the INCLUDE paths. I never did 
suspect that function until I profiled. Still, 
all that function did under Win32 was call 
GetFileAttributes(), and there is little room 
for improvement there. Fortunately, the 
profiles showed that it wasn’t just the time 
spent per call, but also the number of 
times it was called. To cut a long story 
short, I implemented a caching scheme 
(in effect storing the entire dependency 
tree of a MkDep run internally) and sim- 
plified file I/O. As a result, the program 
now processes each file in the entire run 
exactly once, using one ReadFile() call 
per file to do so. This is, of course, opti- 
mal and it accounts for 75 percent of the 
current run time. The remainder is taken 
by _access() calls that fail while search- 
ing the INCLUDE path, and the final I/O 
for the actual dependency lists. There is 
still room for improvement but it can be 
at most 25 percent, which is good to know 
because it sets realistic expectations and 
helps to gauge how much effort should 
go into further optimizations. 

Constrained Optimization. These pro- 
grams tend to do a lot of internal process- 
ing and are therefore mostly CPU-bound. 
My constrained-optimization test program 
is another console application that attempts 
to solve an optimization problem using 
clever search techniques. It uses large 
graph-like data structures that display poor 
locality of reference. To really improve per- 
formance in this sort of NP-hard applica- 
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tions (improvements in the order of 101° or 
more), you need to improve the search 
methods, but I used the profilers to see if 
I could make the operational side of things 
just mildly faster (say, a factor 2 or 3, which 
is peanuts to researchers in this field, but 
still worth some effort). 

Before the profiling sessions, I didn’t 
have a good idea about how time was 
spent in the program, although I suspect- 
ed that memory allocations might play a 
role. To my surprise, the profiling sessions 
revealed that most of the time was spent 
inside a lowly index operator overloading 
in one of the array classes, and in the dy- 
namic_cast<> operator used in several crit- 
ical places. The index operator itself wasn’t 
particularly complicated, but it did bounds 





checking on each call and was called very 
often— well over 43 million times to pro- 
cess just 100,000 nodes in the search tree. 
The dynamic_cast<> operator was used 
for a downcast somewhere in the program 
to obtain application-specific information 
from a generic tree node. It too was called 
often (1.4 million times). 

After removing these bottlenecks, the 
program was about 1.24 times faster than 
the original one, and things became more 
complicated because the cycle eaters were 
more evenly distributed. In the end, I was 
able to speed up the program by nearly 
a factor of two; further improvements 
would have required specialized memo- 
ry allocators or changes to the way infor- 
mation was stored. Without the profiling 
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information, it wouldn’t have occurred to 
me to look at those particular functions — 
even though I was the one who designed 
and implemented both the data structures 
and the searching algorithm. 

Simulation Model Calculations. These 
test programs are part of a Win32 GUI 
program that uses MFC as its application 
framework. My test application is itself a 
business simulation that processes deci- 
sions taken by various “companies” staffed 
with management trainees. The challenge 
was to find the bottlenecks in the actual 
model calculations, since they seemed to 
be slower than necessary. The problem 
here is the interactive nature of the ap- 
plication: How do you make sure that the 
model calculation timings aren’t drowned 
in all the message processing and sur- 
rounding activities of the program? 

As it turned out, the calculations them- 
selves were no problem, but the code con- 
tained calls to logging functions, which 
created an audit trail of the simulation 
model’s decisions. These logging functions 
were the real time eaters, and I could only 
partially remedy that problem. However, 
only a few profilers let me isolate the cal- 
culations from the rest of the program and 
make this analysis obvious. HiProf’s trace- 
points came in very useful here; for Vi- 
sual Quantify I inserted API calls to start 
and stop the profiling at the correct loca- 
tions. All other profilers left me wading 
through long lists of irrelevant informa- 
tion, although with extra effort I could 
have configured the Visual C++ profiler 
to exclude nearly everything except for 
the functions I was interested in. Still, this 
would have meant a fair bit of work, 
which would have to be repeated if I had 
turned to other areas of the program. 

Multithreaded Record Processing. 
My final test was a simple multithreaded 
program borrowed from the HiProf ex- 
amples to see how the profilers dealt with 
multi-threaded programs. I did not attempt 
to optimize anything here; I was just in- 
terested in the information that would be 
obtained from this analysis. 

I was disappointed. With the exception 
of HiProf and Visual Quantify, all profil- 
ers just lump together their timing or sam- 
pling information and never show how 
the time was distributed across threads. 
Visual Quantify has the most convenient 
way of separating and combining per- 
thread information through its Call Graph. 
HiProf shows all threads separately, but 
due to its somewhat involved merging pro- 
cedure, the combined information is not 
very easy to view. 


Profiling Methods 

The profilers I examined use one or more 
of the following methods to obtain timing 
information from the program being tested. 
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Sampling. With sampling, the program 
is interrupted frequently and a note is 
made of the location of its instruction 
pointer. With the aid of a map file or de- 
bug information, this location is translat- 
ed back to a function name or source code 
position. Normally, all samples within a 
single function are lumped together. The 
advantages of this method are its simplic- 
ity and the fact that it has comparatively 
little impact on the execution speed of the 
program under test; the primary drawback 
is the coarseness of the information. Even 
at the best of times the samples are only 
an approximation of where the time is 
spent in a program, and for a variety of 
reasons (granularity, resonance) the col- 
lected samples can be downright mis- 
leading. 

Event-based sampling. Supported 
only by Pentium Pro processors and lat- 
er, event-based sampling uses internal 
counters in these processors to collect in- 
formation about performance-related 
events such as cache misses. VTune can 
use this sampling mode as an alternative 
to standard sampling, and will collect both 
the event data and the regular instruction 
pointer samples by interrupting the pro- 
cessor at frequent intervals. 

Timing. A better method is to actual- 
ly use a timer to measure the time spent 
in a function. This requires that the pro- 
filer be notified of function entries and 
exits (accomplished through instrumen- 
tation or the insertion of hook functions), 
and assumes an accurate timer. The lat- 
ter may be provided by the operating 
system or by the processor. The Pentium 
and later CPUs contain a high-resolution 
counter that is used by several profilers 
for this purpose. The profile is basically 
a record of wall clock time, which means 
that it also takes into account things such 
as cache misses and background activi- 
ty in the rest of the system. Depending 
on the purpose, this could be an advan- 
tage or a disadvantage, but in any case 
it is not as repeatable as instruction 
counting is. 

Line counting. Since the overhead of 
timing is usually too large to make it prac- 
tical at the line level, line counting is of- 
ten used to obtain an indication of the 
program’s performance at the source-code 
line level. It requires some form of break- 
points at the line level and introduces a 
large amount of overhead. In addition, un- 
less coupled with instruction counting, it 
may give little information about the ac- 
tual time spent in some section of code. 

Instruction counting. The final 
method is instruction counting. In 
essence, the profiler counts how often a 
particular basic block in the program is 
executed, then multiplies that with the 
number of clock cycles required for the 


instructions in the block. Obviously, this 
requires detailed information about the 
underlying processor and is often de- 
pendent on the exact version of the pro- 
cessor. Furthermore, the actual number 
of clock cycles per instruction may vary 
with the dynamic behavior of the pro- 
gram and its environment (considering 
things such as cache and pairing behav- 
ior, branch prediction, and so on), so the 
profiler must make some assumptions 
here. Usually, it assumes optimal execu- 
tion. Instruction counting does not in- 
clude any ambient effects, which makes 
it more reproducible and in some sense 
“purer,” but it tends to give a somewhat 
optimistic view of the program’s speed. 
Even so, it is used as the most accurate 
method in the best profilers. 

Code analysis. Intel’s VTune provides 
code analysis, a different method that uses 
very detailed information about processor 
behavior to report penalties incurred, pair- 
ing issues, and expected cache behavior 
down to the instruction level. This is not 
profiling in a strict sense, but it does give 
performance information. A companion 
Code Coach will sometimes advise on re- 
arrangements of the source code that re- 
duce the performance penalties. 
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ocessor on which ihis program 
Gai is. 

HHUA 
idProcessorName() 


ProcessorramilyiD=0. 
lean CouidinsiructionAvailabie: 
lean ProcessorisGenuineintel: 


suidDig op = new CCpuidDigim_pMainvWnd)- 


st Processor informnalion 

ist call native method to 
pxecute CPUs CPUID instruction 
(Ge 8 82 


essor-amilyiD = ProcessorFamily(): 
idinstructionAvailable = 
puidinstructionSupporied(). 
essorisGenuineintel = Genuineintel(): 


Bt processor name 
puidinstructionAvailabie) 


Verify ts a recent Intel’ CPU 
(ProcessorisGenuineintel) 


i 4 is lowest family number 
i we could see here. 
i AN processor signature 
i info is available 
switch (ProcessorFamilylD) | 
case 0: 
case 1: 
case 2: 
case 3: 
// Programmer s paranoia- 
His Vile cari get here! 
p. SeiProcNamel ). 
break; 


case 4: 
// Handle the different types 
fi of intel486(tm) processors 
i separately 
p SetintelFamily4 ProcName 0); 
break; 


case 5: 
// Pentium’ processor 
p.SetProcName (ProcessorName 
IPNAM_ Pentium _Processor)): 


ii Gheck for MMX(im) 
// technology support 
if (ProcessorHas lechnology 
(MMX_ TECHNOLOGY)) 
p. AppendProcName 
(MMX_Technology Str: 


// Add in trademark symbols 

p AppendProcName ( 
Trademark_Str): 

p.AppendProcName ( 
Registered _|rademark_Str): 

break: 


case 6: 
// Pentium’ Pro or 
// Pentium’ Il processor 
if (ProcessorHas lechnology 
(MMX_TECHNOLOGY)) 


p.SetProcName ( 
ProcessorName | 


PNAM Pentium _|lProcessor)): 


} 

else 

i Processor from the Stone Age. 
#i CPUID inst not available 


p.SeiProcName (ProcessorName 
[PNAM NO CPUID))- 


ELT LL 

ii 

ji Procedure: scale _array_elements 
if 

/} Function: 

ji Seale the elements of a floating 
fi point array according to the 

li desired scale factor. 

ij 

TELE 

void scale _array_elements (floal scale factor) 


{ 


a ee 
float scale recip; 


// NOTE: Viune showed this method is compiled 


// at execution time by the JIT compiler. 


scale recip = 1.{/ scale factor, 
for (i = 0; i < size: i++) 
f_array[i] = f_arrayli] * scale_recip: 


// Vtune suggested multiplying by the reciprocal 


//_ instead of dividing in the loop - multiply 
oS esce 

// Original Code: 

f= for i =05 I < Size: i++) 

i f_arrayli] = {_arrayji] / scale factor: 


} 


FTL 
if 
/i Procedure: GetProcessorCacheDescniptors 
i 
// Function: 
#i Use CPUID instruction to retrieve 
fi _ raw data on the processor s caches, 
## then extract the data into a simple 
fi affay. 
if 
JHU 
inti] GetProcessorCacheDescriptors 
fint{] Descriptor, int Size) 


int Cpuidinfoll[] 
600 0) 


Ou U 


/i/ Check parameters 
i (Descriptor == nuill ll Size < 1) 
returm null: 


// initialize result array. 
for (i = 0: i < Size: i++) 
Descriptori] = 0: 


// Get raw cache descriptors from 

i native CPUID assembler routine 

p.GeiCPUID CacheData 0: 

DescriptorsDefined = 
GetRawCacheDescripiors (Cpuidin 


// Reset descriptor output counter. 
i= 0 


// Parse the descriptors f available 
if (DescriptorsDefined) 


for (set = 0; sel < 2; sel++) 


for (dword = 0: dword < 4, dword 
{ 
for (abyte = 0: abyte <= 3, ab 
{ 
Betes aie — 
Couidinfo|set]|dword| 
& Oxil, 
Cpouidinio|set]|dword] 
So 6: 
yy (Beceg sis) 8) 


// Early out on short buffer. 
f G >= Size) 
return Descriptor. 
} 


} 
j 


return Descriptor; 


} 


// Return null on error exit. 
return null: 


} 


HHHHHUUUUHHUMAHAHIL 
i 
// Procedure: Fpulype 
if 
// Function: 
// Determine type of floating point 
i uni. 
Hf 
// returms: 
FPU NONE =: if no floating point uni 
FPU 287  : if 287 floating point unit 
FPU 387 _ : if 387 floating point unit 
FPU _487SX_ =: only if unambiguously 
intel 487SX NDP 
FPU 486DX_ DX2 487SxX: if one of | 
intel486 DX2. or 
intel486 SX processors 
with 487 SX 
FPU_ON CHIP: if Genuine Intel proc 
that supports CPUID 
and that reports FPU 
feature flag. 
: if this routine cannot 
recognize the FPU 


MUTUAL 
int 
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riting a device driver in C using 
the Windows NT Device Driver 
Kit (DDK) can be scary. Luckily, 
there are alternatives, such as 
BlueWater Systems’ WinDK and Vireo Soft- 
ware’s Driver::Works. In this article, I'll ex- 
amine these class libraries and develop a 
hardware-device simulator driver to illus- 
trate their use. But first, let’s take a look 
at the inner workings of Windows NT de- 
vice drivers. 





Windows NT Device- 

Driver Backgrounder 

There are two groups of Windows NT de- 
vice drivers — monolithic and layered. A 
monolithic device driver represents one 
piece of hardware (a data-acquisition 
board, for instance), while a layered driv- 
er sits on top of another driver (or be- 
tween two drivers) forming a hierarchy. 
SCSI and network drivers are examples of 
layered drivers. 

Device drivers run in NT’s kernel mode, 
while applications run in user mode. The 
method of communication between an 
application and a device driver is an I/O- 
request packet (RP). An application uses 
CreateFile to receive a handle to a device 
driver (or more precisely, to the device 
object exposed by the device driver). The 
application can then use standard Win32 
API functions, such as ReadFile, Write- 
File, and DeviceloControl, to communi- 
cate with the device. 


Patrick received his B.S. in computer sci- 


ence from the University of Umea, Sweden. 
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Patrick Tennberg 


A device driver exports one function, 
DriverEntry, which is called when the 
driver is first loaded and is responsible 
for claiming hardware resources and ini- 
tializing the hardware. One single hard- 
ware adapter can contain several differ- 
ent devices (a soundboard can also 
contain a joystick port, for instance). 
DriverEntry is responsible for creating 
one or more device objects. Each device 
object can represent a physical device on 
the hardware; a device object can also 
represent a logical device. 

Applications residing in user mode are 
not interested in communicating with the 
driver, but rather, with the device itself. 
DriverEntry, therefore, creates a symbolic 
link for each device object it creates. Driv- 
erEntry also registers dispatch routines. 
Dispatch routines (for a highest-level driv- 
er) live within the context of the calling 
application; this is called “passive level.” 
A dispatch routine corresponds to a driv- 
er request; examples of driver requests 
are ReadFile, WriteFile, and CloseHandle. 
When an application in user mode calls 
ReadFile, the I/O manager assembles the 
request to an IRP, and dispatches it to the 
callback registered with the driver to han- 
dle read requests. 

A dispatch routine is responsible for 
completing the IRP (that is, performing 
the action requested by an application), and 
returning a status code. A read request reads 
data from the device, while a close request 
cleans up and closes it. The dispatch rou- 
tine cannot always complete the IRP im- 
mediately; in some cases, the device may 
need to be started before data can be read 





or written. The device object contains a 
queue. A dispatch function can choose to 
insert the IRP into this queue and return 
STATUS_IO_PENDING. This instructs the 
I/O manager to put the calling application 
on hold until the request can be fulfilled. 
Before inserting the IRP into the queue, 
the dispatch function attaches a cancel 
routine with the IRP. The cancel routine 
is called by the I/O manager if the appli- 
cation or the operation system closes the 
connection before the driver has com- 
pleted all IRPs. 

Another routine that a driver can reg- 
ister is StartJO. Because it operates in an 
arbitrary context, it is limited in what 
DDK APIs it can call and it must use non- 
paged memory. This rule is true for all 
parts of the driver not running at passive 
level. A dispatch routine stores an IRP in 
the device queue by calling loStartPack- 
et. The I/O manager will, at this stage, 
see if the device object is already busy 
processing an IRP. If the device object is 
busy, the IRP is stored in the associated 
queue for later processing. If the device 
object is not busy, the I/O manager calls 
StartlO directly. StartIO is responsible for 
starting the device (that is, making it 
ready for a read or a write). Before do- 
ing anything else, the StartJO routine 
should check if the IRP has been can- 
celled; if so, it should return immediate- 
ly. When the driver is sure the IRP can 
be completed, it should remove the can- 
cel routine attached to the IRP. 

When the device is finished, the driver 
needs to be notified. The physical device 
can notify the driver by generating an in- 
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(continued from page 92) 

terrupt. The interrupt service routine (ISR) 
then acknowledges the interrupt and gen- 
erates a deferred procedure call (DPC), 
which completes the IRP and starts the next 
package by calling JoStartNextPacket. 
The I/O manager then dequeues an IRP 
from the queue and calls StartIO again (if 
the queue is not empty). The StartJO rou- 
tine lets a driver serialize the communi- 
cation with the physical device. An ISR 
should do as little work as possible to 
keep the interrupt latency down. The real 
work should be done by a DPC. 


A Simulator Device Driver 

The company I work for contracted to 
write device drivers for a manufacturer of 
hardware and software for real-time data 
acquisition and analysis in the medical 
field. Each device driver is represented in 
user mode by a number of components 
that plug into the rest of the system. When 
I received new hardware that needed a 
driver, I started by writing a simulator driv- 
er. The simulator driver correctly handled 
all requests from the various components 
in user mode. It also implemented the 
scheme we had selected to transport data 
effectively from kernel mode to user mode. 

When the simulator driver was ready, I 
continued by writing the user-mode com- 
ponent. The testing group could test the 
system with the simulator driver without 
worrying about the actual hardware. This 
scheme also allowed other developers to 
test their components with the simulator 
driver. The big gain is that I don’t need to 
worry about the state of the hardware and 
eventual bugs. While the testing group 
and the other developers used the new 
components and simulator driver, I fo- 
cused on writing the real driver and re- 
solved problems without holding up the 
rest of the team. 

The driver I developed (available elec- 
tronically; see “Resource Center,” page 3) 
simulates a simple data-acquisition board. 
The driver supports three I/O control re- 
quests (configure, start, and stop) and a 
read request. IOCTL_DATADRIVER_SETUP 
takes a structure as a parameter. This 
structure contains the sampling frequency 
and a channel mask. The channel mask 
determines which data channels to sam- 
ple. IOCTL_DATADRIVER_START starts 
the sampling of all channels and 
IOCTL_DATADRIVER_STOP stops the 
sampling. The sampling is done using a 
repetitive timer. A new feature of Win- 
dows NT 4.0, the repetitive timer restricts 
the use of the drivers to this platform. The 
output of the driver is either a square wave 
or data from a file Gf a filename is speci- 
fied in the registry). The transport of data 
between user mode and kernel mode uses 
buffered I/O; this is inefficient, but it 
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doesn’t matter much because the size of 
the data is small and the sampling rate 
low. The buffered I/O means the I/O man- 
ager copies the data from a buffer allo- 
cated in kernel mode to the buffer pro- 
vided by users in the call to ReadFile. | 
have included a test program that works 
with both drivers and prints the data to a 
console window. 


WinDK 2.5 

Bluewater Systems’ WinDK Device Driv- 
er Development Kit Version 2.5 supports 
Windows NT device-driver development 
in C and C++. The toolkit provides Wiz- 
ard technology, Windows Driver Model 
(WDM) support for both Universal Serial 
Bus (USB) and IEEE 1394 (Firewire) de- 
vices, device-driver profiling, and sample 
code (source to the library has to be pur- 
chased separately). System requirements 
for WinDK 2.5 are Windows NT 3.51/4.0 
(Intel or Alpha), Visual C++ 4.x/5.0, and 
the Windows NT or WDM DDK. 

The WinDK class library is delivered on 
CD-ROM, along with manuals that intro- 
duce the NT driver model and hard-to- 
find information such as how to set up 
WinDBG (the debugger delivered with the 
DDK). The reference manual describes all 
classes and member functions, and in- 
cludes documentation on the C library. 
Because the reference manual is too brief 
to be useful, you are often forced to look 
up the underlying function in the DDK to 
get the information you need. The online 
help consists of a copy of the reference 
manual. 

The WinDK Device Driver Wizard is pow- 
erful and easy to use. However, I have some 
minor complaints about the Wizard. You 
cannot specify the types of the registry keys: 
You must manually change them in the 
code. The same is true for IOCTL. Also, 
when defining new registry keys and IOCTL, 
you can only remove the last in the list. The 
Wizard only lets you select between the two 
available dispatch functions (Read and 
Write). The code generated is heavily com- 
mented and sprinkled with “To Do” phras- 
es where you can add your own code. 

The Wizard is able to generate much 
more complicated skeletons than the one 
used for the simulator driver I present 
here. Multiple devices, multiple device 
queues, and DMA and PCI setup are only 
a few of the more-advanced options for 
which the Wizard will generate code. 

The Wizard generates three source files 
and two header files. One source file con- 
tains the DriverEntry and all dispatch func- 
tions. Another contains the device class def- 
inition (derived from CDevice and/or other 
classes, depending on the selection you 
made in the Wizard) and device class body. 
The file defining the body contains only 
the code for the constructor and the de- 


structor. The third source file contains dis- 
patch member functions; these member 
functions are defined in the first header 
file: the device-class header file. The sep- 
aration of constructor and dispatch func- 
tions makes the code easier to navigate. 
The second header file, which contains the 
IOCTL, can be shared between the device 
driver and a user-mode application. 

For DriverEntry, the Wizard generated 
code for setting up the dispatch routines 
and creating a device object. The only 
thing I needed to do was add a registra- 
tion dispatch routine for a Close request. 

The WinDK class CDevice encapsulates 
the functionality of a device object, includ- 
ing functions for queuing and completing 
IRPs. The work of setting up the physical 
device is done in the constructor of the de- 
vice class. My constructor needed to do 
three things. First, ] used a WinDK function 
to create a symbolic link name. The Wiz- 
ard has generated all code necessary to read 
the registry values I specified when defin- 
ing my driver. If the registry key “DataFile” 
exists and points to a valid file, then the 
constructor allocates memory and reads the 
file into memory. The code for reading the 
file was straightforward to write. The library 
contains the class CFilelo, which hides all 
details of creating, reading, and writing files. 
If the DataFile entry is missing, the con- 
structor allocates memory for two data en- 
tries and sets a flag instructing the device 
to output a square wave. 

The next step was to include functional- 
ity for my dispatch functions. A dispatch 
callback in WinDK is an ordinary C func- 
tion. The library provides a macro that takes 
a device object as a parameter and gives 
you a pointer to the corresponding device 
class. You can then use this pointer to call 
member functions or to manually manipu- 
late data contained in your device class. The 
Wizard has generated a dispatch function 
for handling DeviceloControl requests 
(DataDriverDevicelocilDispatch). The Wiz- 
ard has also generated member functions 
(DataDriverDevice) for handling each of the 
IOCTLs our driver supports. The actual 
IOCTL that caused the dispatch is stored 
with the IRP, and the dispatch function 
DataDriverDeviceloctlDispatch contains a 
switch statement that calls the correct mem- 
ber function. 

The member function DataDriverDevice- 
ToctlSetupDevice will check the parameters 
and copy the frequency and the channel 
map from the structure used as a parame- 
ter to the DeviceloControl call in user mode. 
The DataDriverDevice class uses the WinDK 
class CRepetitiveTimer to simulate data sam- 
pling. This class lets you connect a dispatch 
function that will be called repetitively when 
a certain period of time has expired. 
DataDriverDeviceloctlSetupDevice registers 
a callback routine with the timer. 


Dr. Dobb’s Journal, March 1998 


DataDriverDeviceloctiStartDevice con- 
verts the frequency from “Hertz” to “mil- 
lisecond” and starts the timer. Data- 
DriverDeviceloctlStopDevice will stop the 
timer and print a message if you missed 
any samples. This can happen if the test 
program fails to feed the driver with IRPs. 

Processing a ReadFile request is per- 
formed by the function DataDriverDe- 
viceReadDispatch. It does a sanity check 
on the data and, if everything is correct, 
queues the IRP and attaches the default 
WinDK cancel routine to it. In our case, 
the StartIO function does nothing— all 
work is done in the timer callback. 

The timer callback registered with the 
timer object, TimerCallback, first checks 
if there is an IRP available. You can get the 
current IRP by calling the member function 
GetCurrentirp (defined by CDevice). This 
function returns the current IRP, which 
is assigned to a field in the device ob- 
ject by the I/O manager before the sys- 
tem calls StartlO. It then checks if the 
IRP has been canceled. Because we are 
in the process of completing the IRP, we 
also remove the cancel routine. This en- 
tire Operation is performed by the sin- 
gle WinDK function CheckirpCanceled. 
If this function returns True, you know 
the IRP was canceled and returned from 
the TimerCallback. You then call the 
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TimerCallback member function defined 
in the DataDriverDevice class. The mem- 
ber TimerCallback fills in the structure 
passed as parameter in the call to Read- 
File, and completes the IRP. 

The WinDK has a pragmatic design. Blue- 
Water Systems used it internally before re- 
leasing it as a product. The design goal was 
to have a library that imposes no overhead 
and helps the developer in following the NT 
device-driver design rules. Classes are only 
used to encapsulate common functionality 
and are not designed for inheritance that 
would change the behavior of a class. 

Bluewater Systems put in a lot of effort 
to make programmers follow the design 
rules laid down by the NT kernel design- 
ers. For example, all classes override op- 
erators new and delete. This allows the li- 
brary to allocate the correct type of memory 
(paged or nonpaged) for a class. Classes 
that encapsulate functionality that is only 
available on the passive level will always 
allocate paged memory, and vice versa. 

WinDK contains a number of useful 
tools, such as debugger extensions, a pro- 
filing tool, and performance-counter sup- 
port. The profiling tool can be used to pro- 
file any device driver you have the source 
for. The profiling tool comes complete with 
a user-mode application that displays pro- 
filing graphs. The samples are well com- 





mented and follow the NT device- driver 
design rules. All samples supporting hard- 
ware have the same standard as commer- 
cial drivers. The full duplex serial driver, 
for example, can be used instead of the 
one provided with Windows NT (and it is 
up to 20 percent faster). 


Driver::Works 1.2 
Vireo Software’s Driver::Works is a C++ class 
library device-driver development toolkit 
that includes the Driver::Wizard code- 
generation wizard and Driver::Monitor 
(which lets programmers monitor driver ac- 
tivity without a debugger). The package 
supports NT/WDM driver development and 
comes with full source code for both the 
library and sample drivers. Driver::Works 
requires Visual C++ 4.2 or later, as well as 
the NT or Windows 98 DDK. 

Driver::Works is delivered on two 
diskettes, along with a manual that cov- 
ers installation, the Wizard, and the li- 
brary’s object model. The main part of the 
manual is a cookbook of examples. The 
manual doesn’t provide a complete pic- 
ture of the NT driver model. Still, the on- 
line help is excellent and contains most 
of the information needed to use classes 
and members correctly. 

The Driver::Works Wizard lets you 
specify all dispatch functions. You can also 
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(continued from page 95) 

specify the type of registry keys and IOCTL. 
The Wizard generates advanced skeletons 
that use multiple devices, multiple device 
queues, DMA, and PCI hardware. 

For my simulator driver, the Wizard gen- 
erated four source files and a file con- 
taining the IOCTLs. The source file 
DataDriver (and its corresponding head- 
er file) defines the class DataDriver that 
inherits from KDriver. The Wizard gener- 
ates code that searches the registry for de- 
vices and creates a DataDriverDevice class 
for each device found. The Wizard will 
also provide similar code for searching af- 
ter devices on a bus. All this work is done 
by the KConfigurationQuery class. The 
real workhorse is the DataDriverDevice 
class, derived from KDevice. The con- 
structor contains the code that reads from 
the registry and loads a data file— if the 
“DataFile” key exists. All code for reading 
from the registry is created by the Wizard. 
If the key exists, you use the KFile class to 
read the data file from disk. Otherwise, you 
set up the simulator device to output a 
square wave. This code is almost identical 
to the WinDK code, but one difference is 
that the base class creates the symbolic link. 
The constructor for KDevice takes a device 
name as one of its parameters. 

Driver::Works creates all dispatch call- 
backs as member functions. The library 
creates stub functions that redirect the call 
to the correct member function. The 
scheme involves the same magic as MFC 
and OWL use to map window messages 
to member functions. All this behind-the- 
scenes work can look like a waste of pro- 
cessor time, but the overhead is only 
around 300 microseconds on a 100-MHz 
Pentium. The library lets you use ordinary 
C functions as dispatch callbacks, so the 
redirection is more of a convenience (be- 
cause you will usually call member func- 
tions yourself in the dispatch callbacks). 
Driver::Works lets you use member func- 
tions for a wide variety of callbacks such 
as ISR, DPC, controller objects, and so on. 
However, ISR and DPC should be fast, so 
the code generated by the Wizard uses 
ordinary C functions as callbacks by de- 
fault. This technique is used by the call- 
back for the repetitive timer, handled by 
the class KTimedCallback. 

The code for handling Read, Close, Start- 
IO, and DeviceloControl are identical to 
the WinDK version. The only thing to note 
about the Driver::Works version is the cum- 
bersome way of handling canceling of IRPs. 
The Wizard generates all the code neces- 
sary, but WinDK handles this much more 
elegantly by providing library code that 
can be used transparently. I would like to 
see a function similar to CheckIrpCanceled. 
Also, the default cancel function really be- 
longs in the base class. 
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Driver::Works does an excellent job of 
encapsulating the complete DDK (at least 
the kernel part). The design is flexible 
and unobtrusive. The classes are de- 
signed to be extendible, and can be used 
as building blocks for reusable compo- 
nents. They also provide template-based 
classes for handling different types of 
containers. 

Driver::Works is only delivered with one 
tool — Driver::Monitor— that lets you see 
trace outputs from your driver without any 
debugger. However, this tool requires that 
you use the K7race class for output. 


Conclusion 

Both toolkits greatly simplify the devel- 
opment of device drivers. They both sup- 
port Windows Driver Model (WDM) and 
Windows NT driver development. The 
main difference between WinDK and Driv- 
er:: Works is in the design. Vireo has cho- 
sen to design an academic object- 
oriented framework, while BlueWater pro- 
vides you with utility classes and a C-style 
approach. These differences are illustrat- 
ed in Listings One and Two (listings be- 
gin on page 100). Listing One is code for 
claiming and using I/O ports and inter- 
rupts with DPC using WinDK, while List- 
ing Two is code for doing the same thing 


with Driver::Works. C programmers ex- 
perienced with NT driver development 
might prefer Bluewater’s library. C++ de- 
velopers and developers first exposed to 
device drivers, on the other hand, will 
probably prefer Vireo’s Driver::Works. 
Whichever library you choose, com- 
pared to developing your driver in C, you 
will save a tremendous amount of time 
compared to using just the Windows NT 
DDK. The ideal solution for me is to use 
both— Driver::Works for its design, and 
WinDK for its samples and great tools. 
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VerCheck: Discovering 





Component Version 





A simple utility for 
Win32 programmers 





John Graham-Cumming 


oftware version numbers are often 

the bane of a programmer’s life — 

they must be checked for compati- 

bility purposes, incremented for new 
releases, matched against information 
stored in source-control systems, and fre- 
quently changed on the whims of the 
marketing department. 

Most programmers shipping commer- 
cial software for 32-bit Windows are faced 
with the task of installing common DLLs 
and OCXes that form parts of their pro- 
jects. ’'ve worked on software where up 
to 25 different components (none of which 
I created) had to be installed and up- 
eraded after matching version numbers 
with the customer’s machine. 

In addition, Microsoft frequently updates 
these common components as new fea- 
tures are added and revisions are made to 
operating systems. Many of these versions 
appear in OEM Service Releases (OSR) and 
Service Packs for Windows 95/NT, forcing 
us to figure out which set of DLLs and 
OCXes is most recent and compatible. And 
with the introduction of Internet Explorer, 
Microsoft has so accelerated the pace of 
new common control versions that I have 
to check Microsoft’s online Knowledgebase 
on a daily basis. 


John, director of product architecture for 


Optimal Networks, can be contacted at 
jgc@optimal.com. 
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Numbers 


To help us, Microsoft creates a REDIST 
directory (on the Visual C++ 5.0 disk, the 
full path is \DEVSTUDIO\VC\REDIST) 
containing copies of the components that 
may need to be redistributed and that con- 
stitute a compatible set. Programmers us- 
ing Visual Basic can find a similar collec- 
tion of DLLs and OCXes on the Visual Basic 


CD, and anyone using the Data Access Ob- 
jects (DAO) can also find information about 
the latest versions in the Knowledgebase. 

All this works well until the software is 
released— then you're suddenly faced 
with customers who have varying levels 
of service packs and OSRs, not to men- 
tion versions of Office 97 or Internet Ex- 
plorer that have updates to some of the 
DLLs and OCXes on which their program 
depends. More than once, I’ve seen prob- 
lems caused by an incompatible set of 
components. 


The VerCheck Program 


I’m often faced with the task of walking 
customers through their WINDOWS\ 






SYSTEM directory and asking them to 
read to me version numbers of things 
like COMCTL32.DLL (as opposed to 
COMCT232.DLL and COMCTL32.0CX). 
I’ve also noticed that once you’ve gone 
through about three such version strings 
over the phone, users get a little tired. 
Hence, the program I present here, 
VerCheck, provides you with a list of the 
versions of all the components of rele- 
vance to your program. It is also con- 
venient for both technical support (the 
program requires no configuration or 
command-line arguments) and customers 
(it is quick and easy to use). 

I created VerCheck so that it was flexi- 
ble (the list of components to check is pro- 
vided in a text file), simple (users simply 
type VERCHECK), and easy to deliver (tech- 
nical support need only send users an 
archive containing two files). Since I want- 
ed it to run under Win32 without any sup- 
porting DLLs (which would have put me 
into a loop trying to check DLL versions 
for the version checker!), I opted to im- 
plement VerCheck as a Win32 Console Ap- 
plication. The complete source code and 
executables for VerCheck are available elec- 
tronically; see “Resource Center,” page 3. 


How it Works 

How do you retrieve the version number 
of a component of Windows? The answer 
lies in the resources attached to the com- 
ponent and a couple of Win32 API func- 
tions that are simple once you’ve read the 
documentation carefully. 

The version number for an EXE, DLL, 
or OCX is stored in a special file resource 
called the VERSIONINFO that stores more 
than one version number for the program. 
Table 1 shows the VERSIONINFO re- 
source for OLEAUT32.DLL (a common 
component that needs to be shipped with 
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many third-party OCX controls— it’s 
worth checking its version and that of 
OLEPRO32.DLL if you are experiencing dif- 
ficulties with an OCX). You can examine 
the version resource of a component by 
opening it in Developer Studio with the 
Resources option selected on the File 
Open dialog. 

The first group of elements are binary 
resources, and the second are text-based 
resources and are retrieved as strings. 
You'll notice that there are FileVersion and 
ProductVersion numbers in both sections. 


2, 20, 4054, 1 
PRODUCTVERSION = 2, 20, 4054, 1 
FILEFLAGSMASK Ox3fL 
FILEFLAGS Ox2L 
FILEOS VOS_NT_WINDOWS32 
FILETYPE VFT_DLL 
FILESUBTYPE VFT 2 UNKNOWN 





CompanyName Microsoft Corporation 
FileDescription 
FileVersion 
InternalN ame 
LegalCopyright 
LegalT rademarks 
OriginalFilename 
PrivateBuild 
ProductN ame 
Product¥ersion 
SpecialBuild 


2.204054 
OLEAUT32.0LL 
Copyright © Microsoft Corp. 1993-1996. 


2.20.4054 





the version number, represents the ver- 
sion of this individual component and 
should be checked when deciding 
whether a DLL or OCX needs to be up- 
graded. ProductVersion defines the ver- 
sion of the software product for which 
this component is designed. With system 
DLLs, this tends to be the version of the 
OS that the component was released with, 
or, as in the case of OLEAUT32.DLL, it is 
the same as the FileVersion. 

The binary equivalent of the FileVer- 
sion is FILEVERSION and is stored as a 
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Table 1: OLEAUT32.DLL VERSIONINFO resource. 
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WINDOWS NT 


e Secure path for future on Intel 
e IA-32 versions today 
e 1A-64 at first system ship 


FORTRAN 90 
e F90O Standard Conformance 


e HP, DEC, Cray & Microsoft Extensions 
e Super-scalar & Parallel Optimizer 


e Excellent run-time diagnostics 


C++ 


e ANSI Draft Standard 
e C++ targeted optimizations 


e Standard + Rogue Wave Libraries 


e Early dialects + ANSI C 


UNIX 


e Field proven for Intel, PowerPC, 
Sparc, Mips 


Third party marks and brands are the property of their respective owners 
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EDINBURGH PORTABLE COMPILERS 
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PRODUCTVERSION. All of these fields are 
described in the VERSIONINFO resource 
entry in the Developer Studio documen- 
tation. To provide simple information for 
version number checking, VerCheck reads 
the FileVersion entry for each DLL, OCX, 
or EXE that it is checking and writes it to 
a CSV file (use of the comma-separated 
file format means that the output of 
VerCheck can be read into programs like 
Microsoft Access or Excel for comparison). 
Since the FileVersion is a mandatory string, 
you can be guaranteed its existence (if the 
file has a VERSIONINFO resource). 
VerCheck functions by reading a file 
called VERCHECK.LST (the list of files to 


Figure 1: Sample VERCHECK.LST. 





| http://www.epc.com 


http://www.epc.co.uk. 


be checked), reads the VERSIONINFO re- 
source for each file, and writes the result 
to VERCHECK.CSV. To make the program 
easy to use, I’ve forced VerCheck to find 
VERCHECK.LST in ‘the current directory 
and to write VERCHECK.CSV to the same 
place. While this means you don’t have 
great flexibility, it also means that a tech- 
nical support representative does not have 
to explain as much of the operation of the 
program to a customer. 

To read the FileVersion, VerCheck 
makes three different calls to the Win32 
API: GetFileVersionInfoSize, GetFileVer- 
sionInfo, and VerQueryValue. GetFileVer- 
sionInfoSize retrieves the size of the 





Figure 2: Sample VERCHECK.CSV. 
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VERSIONINFO block in the file being 
checked. Since the VERSIONINFO block 
is of variable length, and its entries are 
also of variable length, there is no C 
structure that defines the block (although 
Microsoft supplies some pseudo-C to 
help explain the outline): Therefore, it 
is necessary to dynamically assign space 
for the structure. 

Once the space is malloc’d, you call 
GetFileVersionInfo, which returns a copy 
of the VERSIONINFO structure from the 
executable. GetFileVersionInfoSize and 
GetFileVersionInfo do not need the path 
of any DLLs and OCXes that are installed 
in the \WINDOWS\SYSTEM directory, 
they automatically search for them. Hence, 
you only need supply the name of any 
common components to retrieve the ver- 
sion information, Windows takes care of 
finding it. 

Once you have the structure in memo- 
ry, you need to find the FileVersion en- 
try. Win32 conveniently provides the Ver- 
QueryValue function for this, alleviating 
the need to traverse this variable-size struc- 
ture and return the string value. Ver- 
QueryValue can also be used to get Com- 
panyName, FileDescription, InternalName, 
LegalCopyright, OriginalFilename, Pro- 
ductName, and ProductVersion. The most 
important entry it returns is the Transla- 
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tion table that contains an array of lan- 
guage and char-set identifiers. 

To retrieve the FileVersion, you need to 
specify the language and char-set that are 
to be retrieved; so, you make two calls to 
VerQueryValue to retrieve the FileVersion; 
see Listing One (listing begins on page 102). 

The rest of the program checks return codes 
and allows users to type “VERCHECK /?” 
to get a help screen. A simple command- 
line parser lets users enter a different LST 
and CSV filename, but without any param- 
eters, the program uses the defaults. 

Figure 1 shows an excerpted VER- 
CHECK.LST file containing a list of com- 
mon components to be check (the last en- 
try shows the use of an explicit path to 
check a piece of the DAO libraries) and 
Figure 2 is an excerpt of output from my 
machine. 


Conclusion 

VerCheck is not a complex program, nor 
is its algorithm revolutionary, but it is an 
example of a utility that satisfies the pro- 
grammer’s need for accurate information, 
the customer’s desire to be interrupted as 
little as possible, and technical support’s 
need for simple solutions. 


DDJ 
(Listing begin on page 102.) 
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-------------------------------------------------------------------------- */ 
VERCHECK p void main( int argc, char * argv[] ) 
{ 


Listing One 


#include "stdio.h" 
#include "windows.h" 
#include "winver.h" 


// We indicate progress with printf 
// Needed for standard Windows definition 
// Needed for the version checking API 


// Input and output files with defaults in case the user does not specify any 
char sourceFile[ MAX_PATH ] "VERCHECK.LST"; 
char destinationFile[ MAX_PATH ] "VERCHECK.CSV"; 


/*----------------------+-----+---+-+- +--+ +++ +++ ++ ++ + 5 +--+ +--+ 5 - === === 
| procedure: printHelp 
| . 
| purpose: Prints program help 
---------------------------------------------------- = - = - = - - - - --- = = - = === == */ 
void printHelp( void ) 
{ 
printf( "Usage - vercheck [ ]\n" ); 
printf( " Source LST file\n" ); 
printf( " Destination CSV file\n" ); 
} 
/*%------------------------------------------------------------------------- 
| procedure: parceCL 
| purpose: Parses the command line for information 
---------------------------- = 5 - + - 5 5 5 = 5 = 5 - 5 5 5 = = = 5 = = - = - = = - == === - */ 
BOOL parseCL( int argc, char * argv[] ) 
{ 
// Check to see if there are any parameters, if any is /? then 
// we will print help and return FALSE, otherwise read the file names 
if ( arge == 2 ) 
{ 
strlwr( argv[1] ); 
if ( stremp( argv[1], "J?" ) == @ ) 
{ 
printHelp(); 
return FALSE; 
} 
} 
if ( arge == 3 ) 
{ 
strlwr( argv[1] ); 
strlwr( argv[2] ); 
strepy( sourceFile, argv([1] ); 
strepy( destinationFile, argv[2] ); 
} 
return TRUE; 
} 
/*----------------------------------------------------- = - === - -- = = - = - == ---- 
| procedure: main 
| purpose: This is where the program begins... 
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FILE * versionLST; // File handle for the list of files 
// Print the banner 
printf( "\nVersion Check v1.@\n\n" ); 
// Check parameters and return if /? 
if ( !parseCL( argc, argv ) ) 
return; 
// Check for the file VERCHECK.LST which we use to work out what 
// we are version checking 
versionLST = fopen( sourceFile, "r" ); 
// Tf the file is open then create the output file 
if ( versionLST ) 
{ 
FILE * versionOUT; // File handle for the output file 
// Create the output file 
versionOUT = fopen( destinationFile, "w" ); 
if ( versionOUT ) 
{ 
// Parse LST file and read version number of each file mentioned 
while ( !feof( versionLST ) ) 
{ 
// The file that we are going to check 
char versionFILE[ MAX_PATH ]; 
// Get the file name to be version checked 
if ( fgets( versionFILE, MAX_PATH, versionLST ) ) 
{ 
LPVOID version; // Storage for the version resource 
DWORD versionSIZE; // The size of the version structure 
DWORD dummy; 
DWORD c; 
// Tidy up the line 
c = strlen( versionFILE ); 
if (eos) 
{ 
versionFILE[ c - 1] = '\@'; 
printf( "Reading version of %s\n", versionFILE ); 
// Get the size of the files version information 
versionSIZE = GetFileVersionInfoSize( versionFILE, &dummy ); 
if ( versionSIZE > @ ) 
{ 
// Create space to receive the version number 
version = malloc( versionSIZE ); 


if ( version ) 

{ 
DWORD versionLENGTH; // Length of actual version 
LPVOID versionINFO; // Version string 
DWORD languageLENGTH; // Length of actual version 
LPWORD languageINFO; // Version string 


// Now read the version number of the file 
if (GetFileVersionInfo(versionFILE,@,versionSIZE,version) ) 
{ 
// Get actual version information 
if (VerQueryValue(version, 
"\\VarFileInfo\\Translation", 
&languageINFO,&languageLENGTH) !=@) 


char versionSTRING[ MAX_PATH ]; 
// Create string to retrieve resource, first 
// entry in language table is used 
sprintf (versionSTRING, 
"\\StringFileInfo\\%4.4X%4.4X\\FileVersion", 
languageINFO[@] , languageINFO[1]) ; 
if (VerQueryValue(version,versionSTRING, 
&versionINFO, &versionLENGTH) != 9) 
fprintf(versionOUT,"\"%s\",\"%s\"\n", 
versionFILE, versionINFO) ; 
else 
fprintf(versionOUT,"\"%s\", 
\"Failed (file version) \"\n",versionFILE) ; 
} 
else 
fprintf(versionOUT,"\"%s\", 
\"Failed (language) \"\n",versionFILE) ; 
} 
else 
fprintf(versionOUT, 
"\"%s\",\"Failed (version) \"\n",versionFILE) ; 
free( version ); 
} 
else 
printf( "Failed to allocate memory\n" ); 


else 
{ 
printf("Failed to get version info for %s\n", versionFILE) ; 
fprintf(versionOUT,"\"%s\",\"Not found\"\n", versionFILE ); 
} 
} 
else 
break; 
} 
} 
fclose( versionOUT ); 
} 
else 
printf( "Failed to create output file %s\n", destinationFile ); 


fclose( versionLST ); 
} 


else 
printf( "Failed to find %s\n", sourceFile ); 


DDJ 
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} 
| 
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if (error) 
return error; 
} 
— Dpsp->state = 1; /* It stopped */ 

Listing One psp->flags = procp->p_pfsflags; 

psp->events = procp->p_stops; 

/* stopevent () psp->why = procp->p_stype; /* why it stopped */ 
* Stop a process because of a procfs event; stay stopped until p->p_step is psp->val = procp->p_xstat; /* any extra info */ 
* cleared (cleared by PIOCCONT in procfs). break: 
+f case PIOCCONT: /* Restart a proc */ 

void if (procp->p_step == @) 

stopevent(struct proc *p, unsigned int event, unsigned int val) { return EINVAL; /* Can only start a stopped process */ 

p->p_step = 1; if (ap->a_data && (signo = *(int*)ap->a_data)) { 
do { if (signo >= NSIG |; signo <= @) 
p->p_xstat = val; return EINVAL; 
p->p_stype = event; /* Which event caused the stop? */ if (error = psignal(procp, signo)) 
wakeup (&p->p_stype); /* Wake up any PIOCWAIT'ing procs */ return error; 
tsleep(&p->p_step, PWAIT, "stopevent", @); } 
} while (p->p_step); procp->p_step = @; 
j wakeup (&procp->p_step) ; 
break; 
° .e default: 

Listing Two return (ENOTTY) ; 

/* New file sys/pioctl.h */ aes 0 

#include <sys/ioctl.h> } = j 

#if O 

struct procfs_status { 

int state; /* @ for running, 1 for stopped */ 
int why; /* what event, if any, proc stopped on */ 
unsigned int val; /* Any extra data */ 

” : ” JAVA VISUALIZER 

#else 

struct procfs_status { ee 

int state; /* Running, stopped, something else? */ Listing One 
int flags; /* Any flags */ 
unsigned long events; /* Events to stop on */ sheen eee ae. 3 
i ; * ent, if any, proc stopped on * Javarrofiter: :Galibac 
pera he ets /* Any odes ee : aa armas, H ear ie of the timer event 
‘ sg=0, reserve 
ae DWORD dwUser=0, // user instance data 
DWORD dwi=0, // reserved 
#define PIOCBIS _IOW('p', 1, unsigned int) /* Set event flag */ DWORD dw2=9) // reserved 
#define PIOCBIC _IOW('p', 2, unsigned int) /* Clear event flag */ ( ; 
#define PIOCSFL _IOW('p', 3, unsigned int) /* Set flags */ oe context object 
/* wait for proc to stop */ context; 

#define PIOCWAIT “TOR('p', 4, anes procfs_status) context.ContextFlags = CONTEXT_CONTROL; 

4define PIOCCONT _IOW('p', 5, int) /* Continue a process */ // get the control registers of the worker thread 
/* Get proc status */ HANDLE worker = TaskManager._workerThreadHandle; 

#define PIOCSTATUS _IOR('p', 6, struct procfs_status) sie Ores banter 

etThreadContext (worker, &context) ; 

#define S_EXEC @x00000001 /* stop-on-exec */ // store the PC 

#define S_SIG  @x®0000002 /* stop-on-signal */ JavaProf.add((INTEL_OP *)context.Eip) ; 

#define S_SCE  @x®@090004 /* stop on syscall entry */ } 

#define S_SCX  @x@Q0@00008 /* stop on syscall exit */ 

#define S_CORE @x@0@00010 /* stop on coredump */ ‘ae 

#define S_EXIT 0x@0000020 /* stop on exit */ Listing Two 


void JavaProfiler::start() 
if (!_procID) { // if we haven't already started . 
// find the maximum precision of the timer device 
TIMECAPS time; timeGetDevCaps(&time, sizeof(time)); 


Listing Three 


/* Part of procfs_vnops.c */ 


procfs_ioctl (ap) // start the quick timer tick profiler 
struct vop_ioctl_args *ap; _procID = timeSetEvent (max(_period,time.wPeriodMin), // delay in ms 
{ 0, // resolution (higher -> less overhead) 
struct pfsnode *pfs = VIOPFS(ap->a_vp) ; &Callback, // callback function 
struct proc *procp; @, // user-supplied callback data 
int error; . TIME_PERIODIC) ; 


int signo; 
struct procfs_status *psp; 


procp = pfind(pfs->pfs_pid) ; 
if (procp == NULL) { 


} 


switch (ap->a_command) { 
case PIOCBIS: 


procp->p_stops |= *(unsigned int*)ap->a_data; Listing One 
break; 
case PIOCBIC: BOOL CTransCtr1::OnEraseBkgnd(CDC* pDC) 
procp->p_stops &= ~*(unsigned int*)ap->a_data; { 
break; CWnd* pWndParent = GetParent(); 
case PIOCSFL: POINT pt; 
procp->p_pfsflags = (unsigned char)*(unsigned int*)ap->a_data; pt.x = pt.y = @; 
*(unsigned int*)ap->a_data = procp->p_stops; MapWindowPoints(pWndParent, &pt, 1); 
break; OffsetWindowOrgEx(pDC->m_hDC, pt.x, pt.y, &pt); 
case PIOCSTATUS: ::SendMessage( pWndParent->m_hWnd, WM_ERASEBKGND, (WPARAM)pDC->m_hDC, @ ); 
psp = (struct procfs_status *)ap->a_data; SetWindowOrgEx( pDC->m_hDC, pt.x, pt.y, NULL ); 
psp->state = (procp->p_step == @); return 1; 
psp->flags = procp->p_pfsflags; } 
psp->events = procp->p_stops; 
if (procp->p_step) { ane 
psp->why = procp->p_stype; Listing Two 
psp->val = procp->p_xstat; 
} else { <HTML> 
psp->why = psp->val = @; /* Not defined values */ <HEAD> 
} <TITLE>Test page for TransCt1</TITLE> 
break; </HEAD> 
cage PIOCWAIT: <BODY BACKGROUND="background. gif"> 


psp = (struct procfs status *) ap->a_data; <OBJECT ID="TransCtl" WIDTH=128 HEIGHT=128 
if (procp->p_step eect) { CLASSID="CLSID: B25D9AF5-E76@-11D@-A@52-@G@A@247B7657" 


error = tsleep(&procp->p_stype, PWAIT | PCATCH, "piocwait", 9); (continued on page 106) 
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(continued from page 105) 


CODEBASE="TransparentControl.d11l"> 
<PARAM NAME="ImageFile" VALUE="http://www.widgetware.com/image@1.bmp"> 


</OBJECT> 


Listing Three 


static HRESULT Download( T* pT, ATL_PDATAAVAILABLE pFunc, BSTR bstrURL, 


IUnknown* pUnkContainer = NULL, BOOL bRelative = FALSE) 


{ 
CComObject<COurBindStatusCallback<T> > *pbsc; 
HRESULT hRes = CComObject<COurBindStatusCallback<T> >::CreateInstance(&pbsc) ; 
if (FAILED (hRes) ) 
return hRes; 
return pbsc->StartAsyncDownload (pT, pFunc,bstrURL, pUnkContainer,bRelative) ; 


} 


Listing Four 


enum tagREADYSTATE 

{ 
READYSTATE_UNINITIALIZED 
READYSTATE_LOADING 
READYSTATE_LOADED 
READYSTATE_INTERACTIVE 
READYSTATE_COMPLETE 

} READYSTATE; 


ou uw wea 
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e ° e 
Listing Five 
// If we're still downloading the image, draw some text and return 
if ( m_nReadyState != READYSTATE_COMPLETE ) 
{ 


DrawString( di.hdcDraw, 
"Downloading bitmap...", 


&re ); 
return S_OK; 


® e e 
Listing Six 

// lf this is the first time through OnDraw, we need to setup the DIB 
if (! m_dib.IsInitialized() ) 


{ 
m_dib.SetBitmapInfoHeader( m_pbBuffer ); 


m_dib.Create( di.hdcDraw ); 
m_dib.SetBits( m_pbBuffer ); 


Listing Seven 


HDC hdcMask = ::CreateCompatibleDC( di.hdcDraw ); 
HBITMAP bmMask = ::CreateBitmap( columns, rows, 1, 1, NULL ); 
HBITMAP hOldMaskBitmap = (HBITMAP) SelectObject( hdcMask, bmMask ); 
SetBkColor( HDC( m_dib ), RGB( 255, 255, 255 ) ); 


SetTextColor( HDC( m_dib ), RGB( @, @, @) ); 
BitBlt( hdcMask, 2, @, columns, rows, HDC( m_dib ), @, @, SRCCOPY ); 


e e e 

Listing Eight 

BitBlt(di.hdceDraw, rc.left, rc.top, columns, rows, HDC(m_dib) ,®,@,SRCINVERT) ; 
BitBlt(di.hdcDraw, rc.left, rc.top, columns, rows, hdcMask, @, @, SRCAND ); 
BitBlt(di.hdeDraw, rc.left, re.top, columns, rows, HDC(m_dib) ,®@,@,SRCINVERT) ; 


DEVICE DRIVERS 


Listing One 


// Getting, claiming, and using resources with WinDK. 


// Get the resources from the registry 
CRegistry *pRegistry = new CRegistry(m_DriverRegPath) ; 


pRegistry->SetRelativePath (Concatenate (pDeviceName,L"\\Parameters") ) ; 
pRegistry->GetKey (L"PortBase", &m_PortBase) ; 

pRegistry->GetKey (L"PortRange", &m_PortRange) ; 
pRegistry->GetKey (L"Irq", &m_Irq) ; 

delete pRegistry; 


BOOLEAN £Conflict; 


CResource *pResources = new CResource(ISA,@,this,2); 
pResources->AddPortResource (m_MappedAddress) ; 
pResources->AddInterruptResource(m_Interrupt) ; 
pResources->AssignCardsResources (m_DriverRegPath, &conflict) ; 


delete pResources; 


// Accessing the I/O ports using WinDK 
WINDK_MAPPED_ADDRESS m_MappedAddress 


// map the device ports 


m_MappedAddress.Length = m_PortRange; 
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m_MappedAddress.PortType = PORT_MAPPED; 
m_MappedAddress.UnmappedAddress.LowPart = m_PortBase; 
m_MappedAddress.UnmappedAddress.HighPart = Q; 


m_MappedAddress.Flags CmResourceShareDeviceExclusive; 


// map I/0 address 
status = MapIoAddress(Isa, busNumber, m_MappedAddress) ; 


// Write to port 
WRITE_PORT_UCHAR ( (PUCHAR) (m_MappedAddress.pAddress + RESET_ALL) ,@); 


// Map an interrupt 
WINDK_INTERRUPT_RESOURCE m_Interrupt; 


// Setup an interrupt with WinDK 


m_Interrupt.Level = m_Irq; 

m_Interrupt.Vector = m_Irq; 

m_Interrupt.Affinity = 9; 

m_Interrupt.Flags = CmResourceShareDeviceExclusive; 
m_Interrupt.Mode = Latched; 


// DpcForIsr and Isr is static functions they can however use member functions 
InitializeInterrupt ( 
ISA, 
0, 
reinterpret_cast<PIO_DPC_ROUTINE> (DpcForIsr) , 
reinterpret_cast<PIO_DPC_ROUTINE> (Isr), 
this, 
m_Interrupt 
);// Claming resources with WinDK 
// The resources are automatically released in the destructor 


// Unmap the I/O address 
UnmapIoAddress (m_MappedAddress) ; 


Listing Two 


// Getting, claiming, and using resources with DriverWorks. Using the registry 
// with Driver::Works. Driver::Works has it's own way of defining devices in 
// the registry. The advantage of using there model is that 

// you can use the KConfigurationQuery class and CreateRegistryPath 

m_RegPath = KDevice: :CreateRegistryPath(L"MyDevice",m_Unit) ; 


KRegistryKey unitKey(*m_RegPath) ; 


if (NT_SUCCESS (unitKey.LastError())) 


{ 
unitKey.QueryValue(L"PortBase", &m_PortBase) ; 
unitKey.QueryValue(L"PortRange", &m_PortRange) ; 
unitKey.QueryValue(L"Irg", &m_Irq); 

} 


// Claming resources with Driver::Works 
KResourceRequest resReq(Isa,@,@) ; 


resReq.AddPort(m_PortBase, m_PortBase, m_PortRange, 2, 9, 
CmResourceShareDeviceExclusive) ; 

resReq.AddIrq(m_Irq,m_Irq) ; 

resReq. Submit (this,m_DriverRegPath) ; 


// Accessing the I/O ports using Driver: :Works 
KIoRange m_Ports; 


// BusType, Offset, I/0 port base, I/O port range 

fStatus = m_Ports.Initialize(Isa,0,m_PortBase,m_PortRange) ; 
// Write to the port 

m_Ports.outb(RESET_ALL,@) ; 


// Map an interrupt with Driver: :Works 
KInterrupt m_Interrupt; 


// Setup the interrupt 
m_Interrupt.Initialize(Isa,@,m_Irq,m_Irq,Latched, FALSE, FALSE) ; 


// The Isr can be a member of the device class, it can also be an static 
// member function to avoid the overhead of an extra function call. 


m_Interrupt.Connect (LinkTo(Isr) ,this) ; 


// Initialize a DPC object which may be queued by interrupt service routine 
InitializeDpcForIsr(LinkTo(DpcForIsr) ); 


// Release claimed resources 
KResourceRequest resReq(Isa,@,@) ; 


resReq.Release(this,m_DriverRegPath) ; 


// The destructor unmaps the memory automatically 


DDJ 
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FaxMan provides Windows Developers with a Plug- 
In programmable fax engine which instantly allows 
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or call 
800-955-8015 





300 Pensacola Road ¢ Burnsville, NC 28714 
704-682-4111 * Fax 704-682-0025 * BBS 704-682-4356 
Email: sales @ data-tech.com 


FaxMan is a trademark of Data Techniques, Inc. Windows is a registered Trademark of Microsoft, Inc. 


AD LINK 80 





UsE MAINWIN TO PorRT 
DN Geo) a Kom), 1 ).¢ 
PN] Loli an @) Ni hae VN 
TO REWRITE ONE LINE. 





AD LINK 91 


Fast. Flawless. Flexible. When you use 
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Michael Swaine 


don’t do benchmarks. I don’t do soft- 

ware evaluations. I don’t do hardware 

evaluations. I don’t do book reviews. 

Okay, I do, but I try not to, and if I 
have to, I don’t enjoy it. 

Rummaging through the dusty back 
shelves in the library at Stately Swaine 
Manor, I came across the December 28, 
1981 issue of InfoWorld, which contains 
a long and boring article entitled, “For our 
readers: /W’s hardware review guidelines.” 

Although this dreary stretch of prose is 
marked by no byline, I suppose I have 
to cop to having written it. At least I do 
remember being told to develop hardware 
review guidelines not long after I was 
hired at JW. I sure don’t remember writ- 
ing the article, though. 

The article went on for pages, big 11x14 
pages, detailing the format of each review 
and what each section of a review meant 
and the process we would follow and the 
ratings we would assign and what they 
meant. We developed equally verbose 
software review guidelines, written either 
by Thom Hogan (currently executive ed- 
itor of Backpacker magazine) or Scott 
Mace (now senior editor at Byte). Maybe 
one of them wrote this piece. 

“As far as we can tell,” wrote the hard- 
ware review guidelines author, whoever 
he or she may be, “/nfoWorld is the only 
publication that has attempted to create 
a set of standards and evaluate equipment 
and software based on them.” Maybe that 
was true. 

As it turned out, I had to write a lot of 
the hardware reviews myself. And a few 
software reviews. I never figured out how 
to make the reviews enjoyable, either for 
the reader or for the writer. 

So when I was offered the job of editor- 
in-chief of Dr. Dobb’s Journal in 1984, one 
of my first decisions was, “no reviews.” 


Adrift at C 
I wish I could say that I obeyed my own 
injunction. 


Michael is DDJ’s editor-at-large. He can 
be contacted at mswaine@cruzio.com. 
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Adventures in Benchmarking 
(My Don’t Do List) 


But shortly after I came on board the 
Good Ship DDj, I noticed that the C pro- 
gramming language was really getting hot. 
(This was several years ago.) There were 
a lot of vendors selling C compilers, and 
I thought that a comparative review of all 
of them would make a good feature ar- 
ticle. C was on the rise, and I wanted to 
catch the wave. I did have the sense this 
time to farm out the reviews to real pro- 
grammers, and I also farmed out the re- 
view guideline process, knowing that I 
didn’t know enough to do it properly. I 
chose wisely in the person to whom I 
gave the guideline project and the re- 
sponsibility for managing this multiprod- 
uct comparative review. 

I wasn’t so wise in scheduling the 
piece. There were a /ot of compilers. 

Before I knew it, we ran smack into the 
iceberg of the drop-dead deadline for the 
issue. 

Somehow or other, we made the thing 
work out. Then-managing editor Vince 
Leone got everybody into lifeboats and 
saved the day. The issue proved to be 
one of our most popular ever. 

It took its toll on me, though, turning me 
from a happy-go-lucky friend to all into 
the grouchy curmudgeon I am today. You’d 
think I’'d learn, but I continue to write re- 
views, recently for MacUser (until it mor- 
phed into MacUserWorld or whatever it 
calls itself). And I hated every minute of it. 

I still can’t answer the simplest ques- 
tions, like should I buy the old, cheap, 
soon-to-be-obsoleted model, or the new, 
expensive, soon-to-be-discounted model? 

But I keep doing it. I’m still doing it. I 
don’t know why. 


The Final Frontier? 

But I won’t review Userland Software’s 
Frontier scripting system. I’ve reviewed 
Frontier repeatedly over the years, and 
enough is enough. I do want to report, 
though, that grouchy curmudgeon Dave 
Winer and his loyal crew have released 
Version 5 for Mac and Windows in al- 


pha, perhaps even in release versions 


by the time you read this. The version 





numbering is version syncing; the Win- 
dows version is actually the first version 
for that platform. It’s nice that they re- 
leased for the two platforms almost si- 
multaneously. 

It’s news that Frontier is available for 
Windows. You may or may not recall the 
history of the product. 

Winer began work on Frontier long be- 
fore there was such a thing as AppleScript. 
He talked regularly with Apple insiders 
about the likelihood of Apple developing 
something that would compete with Fron- 
tier, and was repeatedly reassured by what 
he heard that he wasn’t being stupid to try 
to create a system-level scripting system 
for the Mac. Then he released Frontier and 
Apple released AppleScript, and Winer 
found himself in the unenviable position 
of trying to compete against a free, bun- 
dled product. He did the only thing he 
could, he took the high road, positioning 
Frontier as a more powerful AppleScript. 
That worked, but not well enough, and 
eventually Frontier became a free product, 
although Winer and his minions never 
stopped supporting and upgrading it. 

Now they’re trying to turn it into a com- 
mercial product again, and have increased 
their potential market by porting it to Win- 
dows. Exactly how they turned a Mac 
system-level scripting tool into a Windows 
ditto is something of a mystery to me, but 
if 1 explored that mystery it would be dan- 
gerously close to reviewing the product 
again. I'll leave it as a mystery. 


Rhap Sheet 
I cant review Apple’s Rhapsody operat- 
ing system. 

There’s a basic problem in evaluating 
software: You're given a beta version un- 
der a nondisclosure agreement and you 
expect the release version to be out by 
the time your column sees print. What 
can you say? What can I say about Ap- 
ple’s Rhapsody that isn’t embargoed and 
won't be false or misleading by the time 
the column sees print? 

I can say this: I wrote this column in 
the Blue Box. 
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The Blue Box is the Mac OS don’t-call- 
it-emulation component of Rhapsody. I'll 
say this: Even in this initial developer re- 
lease, Blue Box is sometimes slightly faster 
than the native MacOS. 

Yes, faster. 

Blue Box is still early beta software as 
I write this, so it crashes often: twice in 
the first half hour. But when Blue Box 
does crash, it’s just an app in Rhapsody 
that’s crashed; it doesn’t hang the whole 
machine. And in my experience, Blue Box 
relaunches slightly faster than the MacOS 
reboots. 

The machine Apple lent me has a dual- 
boot configuration; I can boot into the 
MacOS or into Rhapsody. Once in Rhap- 
sody, I can launch Blue Box and I’m in 
the MacOS, so there are two paths to 
Macishness. Although Rhapsody and Mac- 
OS have different disk partitions and file 
formats, I can read all my Mac disks and 
my MacOS partition once I’m in Blue Box. 
If this developer release supported Ap- 
pleTalk so I could print, I’d just do my 
Mac-based work in Blue Box. I can han- 
dle an occasional crash; hey, I use 
Netscape Navigator. 

Rhapsody itself hasn’t crashed once in 
the months I’ve been playing with it. 

I can also say this: Based on what I’ve 
seen, there are certain things I'll be look- 
ing for in the Premier release or the lat- 
er Unified release later this year. 

I'll be looking for decent performance, 
and I see reason for optimism about that. 
I'll be looking for solid tools for manag- 
ing servers, and I see no reason to think 
that these will be lacking. I'll be looking 
for smooth UI integration, and here I’m 
not so sure all the problems will be solved 
by the premier release. 

And I'll be looking for driver support. 

As I write this there’s a lot of chat about 
the lack of drivers, and of vendors think- 
ing about writing drivers, and of any kind 
of driver development kit from Apple. Ap- 
parently, this chat is a waste of bandwidth, 
since it’s way too early to think about de- 
veloping drivers. But I will be watching 
with interest to see if drivers arrive when 
it isn’t too early any more. 


Why the Server 

Market is Critical for Apple 

For Apple’s recent moves to pay off and 
the company to reemerge as anything but 
a niche vendor, Rhapsody has to become 
a player in the server market. Here’s my 
reasoning: 


e Apple has to hold onto its current mar- 
ket while growing. It won’t be possible 
to grow, to attract new customers or to 
open new markets, until the market- 
share slide is halted. Nobody wants to 
buy into a dying company. 


¢ To hold onto the current market Apple 
must fully support the MacOS. Jobs un- 
derstands that: The company has in- 
vested heavily in the MacOS in the past 
year, and brought out a good, if not 
earth-shaking upgrade in OS8. But that’s 
not the whole story. 
To fully support the MacOS, Apple must 
offer no alternative. Offering Rhapsody 
to the current user base would be an ad- 
mission that the MacOS is history, would 
dry up third-party development in a flash, 
and would signal customers that they 
were free to consider ai/ their options. 
e “The current user base” means those 
market segments where Apple still has 
respect and meaningful marketshare. 
That means education and content de- 
velopment, and that’s the turf Apple 
has to defend with its last, dying 
breath, using MacOS for now. Maybe 
for a long time. 
e New markets are a different story, and 
for Apple, servers are essentially a new 
market, with no appreciable market- 
share to lose. That’s where Rhapsody 
can drive a wedge. But the server mar- 
ket is competitive; can Apple win 
enough converts to Rhapsody to be- 
come a player? Read on. 
If Apple can’t make Rhapsody a player 
in servers, it probably will have missed 
its last chance to grow beyond the niche 
markets of education and content de- 
velopment. In that case, the company 
might be able to create a nice little niche 
business and hold onto or even grow 
marketshare in those two niches. Part- 
nering with other companies like Ora- 
cle, migrating more and more of Open- 
Step into the MacOS, some RedBoxish 
support for running Windows (or NT) 
apps— there might be a way. 
But if Apple can establish Rhapsody as 
a player in the server market (and stop 
the marketshare erosion in both total 
sales and its key markets), it’s a differ- 
ent story. Then developers will be 
drawn to Rhapsody (or Yellow Box) for 
the economy of development effort and 
the ability to write once and run on 
multiple platforms. Then the press will 
see Apple with two arguably success- 
ful operating systems and a corporate 
turnaround strategy that appears to be 
working. And then MacOS customers 
will begin to envy the Rhapsody users 
who have what they perceive to be the 
future mainstream Apple OS. Which is 
what it is intended to be. But first must 
come the customer perception. First 
must come the desire. 


For Apple to Win, Microsoft Must Lose 
So the question is, what will it take for 
Apple to make Rhapsody a player in the 
server market? 
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Protestations about not needing Micro- 
soft to lose notwithstanding, that’s a mar- 
ket currently targeted by Microsoft for NT. 
Apple’s apparent strategy of establishing 
a beachhead with Rhapsody in servers 
and then seeping onto the desktop (I 
promise that is the most mixed metaphor 
you'll find in this column) is not a bad 
one, but it’s also Microsoft's strategy for 
NT, and Microsoft has a lot more lever- 
age to make its strategy work. 

A lot of ill-informed people think NT 
has already won the server market, al- 
though that’s far from true. Nevertheless, 
carving out space in the server market 
means competing directly with Microsoft 
in a market it intends to dominate. There 
are only three ways I can see to sell into 
this market: Steal from UNIX, steal from 
NT, or catch new users. Since Microsoft 
is aggressively trying to do the first and 
the last, all three strategies put you square- 
ly in Microsoft’s gunsights. 

I think that the Jobs strategy is to com- 
pete by delivering better technology. A 
more solid operating system, better apps. 
And that implies another step in the pro- 
cess. Before Rhapsody can move to the 
desktop, it must become a player in the 
server market, yes. But for it to become 
a player in the server market, there must 
be compelling applications that are not 
just Windows or UNIX or MacOS(?) ports. 
So developers have to develop apps for 
the platform. (There are some great apps 
already, but there need to be no chinks 
in the armor of third-party software. The 
MacOS is losing marketshare chiefly be- 
cause of perceived holes in the knees and 
elbows of its software coverage.) 

Getting developers to develop for Rhap- 
sody in large numbers means giving them 
more than the hope of a market; many 
will want to see a sizable and plausible 
market right now. That’s where Yellow 
Box for Intel comes in. Write for Yellow 
Box and you can run on Yellow Box for 
Intel or on Rhapsody on PowerPC hard- 
ware. A huge market, a nifty development 
environment, a chance to develop apps 
that you couldn’t develop for Windows 
or the Mac. Maybe that’s a compelling 
story for developers. I think that’s the strat- 
egy. It sounds possible to me. It also 
sounds like a house of cards. I hope it 
doesn’t collapse. 

And I think that’s about all I can say 
about Rhapsody. 


No Prizes 
I don’t award prizes. I sometimes award 
no-prizes, in the tradition of Stan Lee, the 
creator of The Fantastic Four and Spi- 
derman, who was overlooked by the No- 
bel committee again last year. But I don’t 
award prizes. 

(continued on page 114) 
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WARNING: Programmers in general have 
determined that without Opus Make your 
company's health may be seriously jeopardized. 






In today's fast-paced world, time to market is everything. So why risk your company's health 
using a slow and unreliable build utility? You need Opus Make. It is the fastest, most full-featured, 
most reliable make utility on the market. Opus Make is designed to efficiently speed up your build 
process. You will develop your software faster, get it to market sooner and, in return, improve your 
company's health — and yours. Find out why Andrew Binstock says "Opus Make is the most 
compelling reason for not using the make utilities bundled with today's compilers." 


Opus Make version 6.1 features include: 


Makefile Compatibility: Processes Intersolv Configuration Other features: Multiple-directory 
Builder™, Microsoft™ NMake and VC++, Borland Make™, support @ Pattern-based inference rules @ 
and other makefiles. Multiple targets created from single 
Version Control Support: Access source files in Microsoft. Source ¢ Automatic and in-line response 
SourceSafe™, Burton Systems TLIB™, Intersolv PVCS™, and __ files ¢ Queued shell lines ¢ Object library 
MKS Source Integrity™ version control systems. maintenance ¢ Conditional, looping and 
Microsoft Visual C++ Support: Opus Make integrates with include directives * And much more! 
VC++ 2.x and 4.x. Opus MKMF V6.1 included : MKMF is our makefile 
Multiple Platform Support: and dependency generator. It quickly builds and 

DOS, OS/2, WinNT, Win95, AIX. maintains your makefiles. It understands C-preprocessor 

HP/UX. SCO, SunOS, Solaris, Irix,  ‘lirectives and resource compiler files. If you hate building 

DEC AlphaNT, and others. makefiles by hand this is the tool for you! 





Opus Software, Inc. © 1032 Irving Street, Suite 439 ¢ San Francisco, CA 94122 
Phone: (415) 485-9703 or 800-248-6787 © Fax: (415) 485-9704 
Email: info@opussoftware.com © Web: http://www.opussoftware.com 
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We're first again! 


Three Firsts in Test Tools for Java Developers! 


We were first with DeepCover™—The World’s First Test Coverage 
Tool for Java. DeepCover for Java reveals how well your code is tested at 
the application, class, method, and condition levels, easily and intuitively. 


We did it again with TotalMetric'’—The World’s First Static Metrics 
Tool for Java. TotalMetric for Java captures and displays traditional 

complexity/effort and advanced object-oriented metrics to help focus your 
Java testing. 


























And now again with AssertMate™—The World’s First Code Assertion 
System for Java. AssertMate for Java provides a fast and rigorous code 
assertion toolkit for programmers and class level testers. AssertMate 
supports data assertions, pre- and post-conditions, and invariants to 
validate code correctness in your Java programs. 


RST is dedicated to building the World’s first set of practical, early lifecycle 
testing tools for developers and testers alike. Identifying and removing bugs 
at the earliest stages of development is proven to have a major positive 
impact on the cost and quality of developing software. Our WhiteBox 
Software Assurance Tools™ provide software engineers the vital informa- 
tion they need to efficiently and effectively test their code. 








For more information on WhiteBox Software Assurance Tools for Java, 
and our introductory offer, visit: 
www.rstcorp.com/whitebox.html. 








Or contact us at: 

Tel: 703-404-9293 

Fax: 703-404-9295 
Email: info@rstcorp.com 
Internet: www.rstcorp.com 






System Requirements: RST Test Tools run on 
most UNIX and Windows-based platforms. 
All product names referenced herein are 
the trademarks or registered trademarks of 
their respective companies. 


Reliable Software Technologies 
When failure is not an option 
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= EXPERT ADVICE WHEN 
= YOU NEED IT MOST 


| OF Dobb's — Li rash 


ALGORITHMS | 





The Software Professional's Essential Resource 


The Dr. Dobb’s CD-ROM Library is an extensive collection of powerful tools and resources, essential to every programmers productivity 
and success. The library focuses on critical aspects of programming ranging from an expansive back-issue archive of Dr. Dobb's Journal to 
essential book selections on Algorithms and Data Structures and Graphics Programming. And coming from Dr. Dobb's, you know that 
each CD-ROM is chock-full of source code. Take a moment to browse through our library. We guarantee your satisfaction. 


ERIES 


D I. D 0 b b ‘s/ C D Re lease 5 | — OM — iG; 


Dr. Dobb’s/CD Release 5 provides you with immediate and complete access to nine-and-  danuary “ ns June 97 
a-half years of Dr. Dobb’s Journal. Every Algorithm Alley, every Swaine’s Flames, every | sf rr 1} ahh Ks 7 0D 
article, and every bit of source code from January 1988 through June 1997 isincluded on ff ; MOE RU 

this CD-ROM. In addition, Dr. Dobb’s/CD Release 5 contains all issues of Dr. Dobb's Sourcebook. | Release 5 
Bringing together a decade’s worth of perspectives and expertise from the most respected |i . brates oe: _. 
programmers in the industry, Dr. Dobb’s/CD Release 5 offers both long-time and first-time a -__. _ | 
readers an unparalleled programming resource. : nour OS. + Search Engine 


Use Release 5's powerful search engine to find what you are looking for and then print ("<= oe : a 
it, copy/paste it, or directly export it as a file. No more sifting through dozens of (New Release! 


back issues, and manually keying source code into your current programming pro- Fast search Engine 
ject. With Dr. Dobb’s/CD Release 5, you have it all when, where, and how you want it! 


Release 5 includes these must-have features: 


Super-fast full text, Boolean, search engine, indexes for both articles and authors, copy/paste/print functionality, save searches for 
quick access later, annotate and bookmark for future reference needs. 


Save $30.00! NEW REDUCED PRICE! $49.95 








$29.95 upgrade available to prior owners! To upgrade call: 800-822-1162 


IRDER TODAY! 
300-992-0549 


y.S. & Canada Go to our web site for more products from Dr. Dobb’s CD-ROM Library: 
2 » & “i A 


www.ddj.com/cdrom/ 





Basic 
Cryptanalysis 





Dr. Dops’s EssENTIAL BOOKS ON 
CryPTOGRAPHY AND Security CD-ROM 


The editors of Dr. Dobb’s Journal have selected the most important books on encryption 
technology and have compiled it into a resource every programmer must have. This CD-ROM 
presents both the theory and practice of network security implemented in C, Basic, 
and other familiar programming languages. 





'2 COMPLETE BOOK $1 







“Search Engine + Full Text 
*Source Code * Images 






Special Features include: Full-Text Search Engine, Complete text of all books, Hyperlinks 
across all books, Indexes for all books, Technical bulletins, Security briefs, and Cryptographic 
FAQs from RSA Data Security. 


The Essential Books on Cryptography and Security CD-ROM provides detailed 
documentation and analysis of: Classical and Modern Encryption Algorithms, 
Private-Key and Public-Key Cryptography, The Data Encryption Standard, 
Signature Schemes, Block Ciphers and Stream Ciphers, Plus Much, Much, More! 


4 gps aaesaaia ae 
P 








Due to U.S. Government Restrictions, this a eee 
CD-ROM is only available for U.S. and Canadian customers. 


Includes these books: 

1. Applied Cryptology, Cryptographic Protocols, and Computer Security Models by Richard Demillo 

2. Applied Cryptography: Protocols, Algorithms, and Source Code in C, Second Edition by Bruce Schneier 
3. Contemporary Cryptology: The Science of Information Integrity edited by Gustavus J. Simmons 

4. Cryptography and Data Security by Dorothy Denning 

5. Cryptography: A New Dimension in Computer Data Security by Carl Meyer 

6. Cryptography: Theory and Practice by Douglas Stinson 

7. Handbook of Applied Cryptography by Paul C. Van Oorschot, Scott A. Vanstone, and Alfred Menezes 
8-11 Military Cryptanalysis, Volumes I-IV by William Friedman 


12. RSA Laboratories FAQ on Cryptography, RSA Laboratories Technical Reports, RSA Laboratories Security 
Bulletins, and CrytoBytes Newsletter 


12 Gomplete Books ¢ Includes forward by Bruce Schneier 





JRDER TODAY! 
300-992-0549 


J.S. & Canada 
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Software Licensing with No 
Hardware Key and No Disk Key 


With CrypKey, you not only stop unauthorized use and 
duplication of your software, you can increase your sales! 
CrypKey truly adds features to your software. Offer usage by 
runs, time, enable or add options by phone, fax or e-mail! 
With 27 function calls, CrypKey offers you complete flexibility. 


Also available as CrypKey INSTANT 
Protects 16 or 32 bit Windows app's in 5 
minutes, with no coding! Just pick your 
protection options, like Ready-To-Try, click 
the EXE you wantto protect, and your software 
is ready to sell itself from your website, CD- 
ROMs or floppies with complete security! 















Ready-To-Try feature offers secure 
Web distribution of your software! 
Allows 1 trial period, like 30 days or 5 runs. 
Back-dating or even deleting the directory and 
re-installing doesn't fool CrypKey. Your first 
customer contact is when they buy! 


YES, It Really Works! 


Professional programmer evaluations agree: CrypKey 
offers the best combination of security, reliability, 
economy and ease of use with near-zero support. 


Supports Win3.x/95/32s/NT/95B(FAT-32), OS/2, 
DOS and intelligently manages network licensing on 
all Novell & Microsoft based networks. 


sLelth/Amect- lity ile tlelameierle-laiicsen 


Kenonic Controls Ltd., Calgary, Canada, Phone: 403-258-6200 Fax: 403-258-6201 e-mail: crypkey@kenonic.com 


CrypKey INSTANT is , Ready-To-Try for 30 days, with full security, 
just like your software can be! 











AD LINK 310 





114 








(continued from page 111) 

The Nobel committee does award 
prizes: 

Last year, once again, the committee 
awarded no Nobel Prize for hardware or 
software, but it gave the literature prize to 
Dario Fo, an Italian satirist who offends the 
church, the government, and everybody 
whose shirt shows any sign of stuffiness. 

Oh, and I don’t review books. Although 
I do write about them. 

That’s probably a distinction without a 
difference. What I mean is that I don’t fol- 
low anyone’s reviewing guidelines when 
I write about a book. I just say what 
seems to need to be said. 

What seems to need to be said about 
The Trouble with Dilbert, by Norman 
Solomon (1997, Common Courage Press, 
Monroe, ME; ISBN 1-56751-132-5), is the 
following: 


In his book attacking Dilbert cartoonist 
Scott Adams, Norman Solomon is guilty of 
exactly the sort of crass commercial op- 
portunism of which he accuses Adams. 
Solomon, who has written several cau- 
tionary books about politics and media 
bias, has capitalized on the popularity of 
Adams’ cartoon character to recycle a num- 
ber of his political chestnuts, clothing them 
as a critique of Adams and his creation. 


I don’t challenge Solomon’s claim that 
Adams is only in it for the money, since 
Adams says it himself. If, as Solomon 
claims, many people think that Adams is 
waging some sort of moral war against 
corporate evils, then those people are kid- 
ding themselves. Adams is, as he admits, 
an MBA who hit upon a clever money- 
making scheme, and is merchandising it 
for all it’s worth. 

Scott Adams is no Dario Fo. But he’s 
no faux Fo, either. 

Although I don’t mean any criticism of 
Solomon’s other books, which raise issues 
often papered over by the mass media, this 
book is cogno-intellectual blather. 

Dilbert, on the other hand, is funny. 
Not as funny as South Park, but funny. 


And Speaking of Criticism 
I don’t make mistakes...like some of my 
readers. 

Readers evaluate writers, I’ve noticed, 
although who invited this is beyond me. 

Andrzej Kozlowski, for example, wrote 
to point out with regard to my November 
1997 column that “Machiavelli was a prince 
only on the basis of the principle “You are 
what you write.” That, Andy goes on to 
say, “would make Dostoyevsky an idiot.” 

Oh, right. Like I’m going to take criti- 
cism from somebody who can’t even spell 
Andrew. 


Duh. 


DDJ 
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Tight Deadlines for 
Multithreaded 
Apps? 


with 





YOUR USERS want responsive apps, and 
YOUR BOSS wants the project finished on time. 
With Threads.h++, you can give your users 

the performance of multithreaded C++ apps and 
still meet your deadlines! 

Threads.h++ is YOUR COMPLETE SOLUTION 
for writing portable, multithreaded applications. The 
library eases the burden of multithreaded programming 
in C++ by encapsulating key multithreading features as C++ objects. Plus, its 
plattorm-independent programming model hides the differences in threading 
support across operating systems. Just write your applications once, then 
recompile them to run on Win NT/95, Solaris, HP-UX, AIX, IRIX, Digital 
Unix, and OS/2. 

Concerned about leveraging your single-threaded C++ code? Simply wrap 
your existing global and member functions with the Threads.h++ functor classes! 
Combine them with other classes in Threads.h++ and multithread using your 
existing code without the intrusive changes required by other libraries. 

To make things even easier, Threads.h++ provides a variety of high-level 
threading abstractions such as producer-consumer queues and JOUs. IOUs 
allow you to create asynchronous functions that can immediately return a 
representation corresponding to some “future” result. Producer-consumer 
queues coordinate communication between cooperating threads. 

To learn more about Threads.h++ and how it can speed the performance of 


your apps, visit our Web site at http://www.roguewave.com. 


SOFTWARE 


The Software Parts Company” a Rogue VWave 





WWW.FOGUueWaVe.COM 


Call us toll free in the U.S. at (888) 507-6483. 
Telephone us in Europe: ROGUE WAVE SOFTWARE GMBH: +49-6103-59 34-0 @ ROGUE WAVE SOFTWARE B.V: +31-20-416 06 57 
@ ROGUE WAVE SOFTWARE S.A.R.L.: +33-1-5568 1008 @ ROGUE WAVE SOFTWARE-UK. LTD.: +44-118-988 0224 





Rogue Wave and .h+~+ are registered trademarks, and Threads.h+~ is a trademark of Rogue Wave Software, Inc. All other trademarks are the property of Rogue Wave Software, Inc. or their respective holders. 
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Bill to: Bon Pitzer 
133 Southcenter ft 
Morrisville, NC 27568 


SKU# Qty Description 


[aszeano|y| 2  Heartytogtight 
19465) +) 6 _ Fireplace Goid Set 


'28735PL|+| 7? Silver Plated Silver Set 


(2190440 ©) 3 fireplace Complete Unit 


RCE ct Cee AEA SPIED ee 


(55489M0)| v | Crate Large Lights 


,956740¥i +) 4 Chimney Sweep Kit _ 


Special instructions 


1. inform customer SKU is on Back order 
2. Products shipped via fir 
4. Payment by card on file 


_Post and Print Re 
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PROGRAMMING 


The Persistent 


Template Library 


Al Stevens 


n two recent columns, I introduced my 

new friend, Marine Corps Major Buzz 

Lightyear. I met Buzz when I attended 

Jim McCarthy’s TeamworX BootCamp 
in May, 1997. Jim, Buzz, and I staged a 
reunion at Software Development ’97 East 
in Washington, D.C. I gave Buzz his nick- 
name at BootCamp because his attitude, 
enthusiasm, and occasional frustrations re- 
minded me of the character in Toy Story. 
Until now, I have not revealed Buzz’s true 
identity in this column because of an 
agreement with the BootCamp attendees 
that I would not use anyone’s real name. 
Buzz released me from that commitment, 
but I continued to observe the original 
promise; other members of the team 
might read this column and think I was 
breaking my word. 

I can now bring Buzz out of the closet 
(well, not in the traditional sense—I’m just 
going to tell you who he is). Actually, he 
did it himself with the help of none other 
than Bill Gates. If you attended Bill’s Fall 
Comdex keynote address or saw reports 
of it, you know he was joined on-stage by 
Marine Corps Major Jim Cummiskey and 
sports-legend Kareem Abdul-Jabbar. Major 
Jim Cummiskey, USMC, now famous far 
beyond the reach of this humble column, 
and Buzz Lightyear are one and the same. 
During his Comdex presentation, he man- 
aged to slip in the BootCamp buzzword, 
“resonate,” a reference that I’m sure res- 
onated with all the team members who 
happened to see it. 


A Persistent STL 

Three years ago this month, I interviewed 
Alexander Stepanov, the creator of the 
Standard Template Library, which the 
ANSI/ISO committee had accepted as part 
of Standard C++. We discussed STL and per- 
sistence, a property which, in the object- 


Al is a DDJ contributing editor. He can be 
contacted at astevens@ddj.com. 
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oriented lexicon, permits an object to out- 
live its creator (the program, not the pro- 
grammer) and to move from the space in 
which it was created to some other 
space — permanent storage, the memory 
space of other programs, or both. Persis- 
tence is usually implemented with disk 
files that store object memory, which per- 
mit programs to reinstantiate previously 
created objects, a concept that views tra- 
ditional database management from an 
object-oriented perspective. 

In our interview, I observed that STL 
does not implement a persistent object 
container model, and I asked Alexander 
to comment on STL and persistence. He 
responded: 


This point was noticed by many people. 
STL does not implement persistence for a 
good reason. STL is as large as was con- 
ceivable at that time. I don’t think that any 
larger set of components would have passed 
through the standards committee. But per- 
sistence is something that several people 
thought about. During the design of STL 
and especially during the design of the al- 
locator component, Bjarne [Stroustrup] ob- 
served that allocators, which encapsulate 
memory models, could be used to encap- 
sulate a persistent memory model. The in- 
sight was Bjarne’s, and it is an important 
and interesting insight. 


This statement caught my attention. I 
have long been interested in using pro- 
gramming languages to implement 
database-management software. In 1987, 
I published C Database Development, a 
book with code that implements a rela- 
tional database by using the features of 
the C language and its preprocessor. The 
first edition used K&R C. The second edi- 
tion, published in 1991, used Standard C. 
I called this software “CData: The Cheap 
Database Management System.” In 1992, 
I published C++ Database Development, 
a book that presented an approach I called 
“Parody: The Persistent, Almost Relation- 


ike T8 


an 





al Object Database Manager.” Parody, and 
its successor, Parody 2 (a 1994 second edi- 
tion that used newer C++ language con- 
structs), employed inheritance to imple- 
ment a persistent object mechanism. The 
technique works, but is somewhat cum- 
bersome and limited because it inherits 
the classes that produce persistent objects 
from a persistent base class. Many pro- 
grammers still use CData and Parody for 
their small database and object storage 
applications. 

Following my conversation with 
Stepanov, I watched the committee’s 
progress with the hope of expanding my 
persistent object ideas to use the STL con- 
tainer programming model— templates 
and generic programming — rather than 
inheritance. Persistent containers rather 
than persistent objects, it seemed to me, 
would better support the persistence needs 
of most programmers. I delayed starting 
work until the committee’s work was near 
completion and until at least one major PC 
compiler supported enough of Standard 
C++ and the STL to enable me to devel- 
op, implement, and publish my ideas. That 
time is now. The committee has approved 
Standard C++, and Microsoft Visual C++ 
5.0 implements most of Standard C++ and 
STL with only a few language enhance- 
ments remaining to be implemented — 
complete enough for me to get started. 

I looked at two other compilers. The 
gnu-win32 port of GNU C++ uses the orig- 
inal HP release of STL without default tem- 
plate arguments in the template class dec- 
larations. Borland C++ 5.2 uses a version 
of Rogue Wave’s STL that also does not 
employ default template arguments. Nei- 
ther of these implementations are close 
enough to the Standard C++ specification 
to work with what I have in mind. Micro- 
soft licensed P.J. Plauger’s STL imple- 
mentation, which is close enough to the 
Standard to test my ideas. There may be 
others on the PC platform. Eventually, all 
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compilers will comply with the Standard, 
and the code published with this column 
should work with those other compilers. 
There is not, as far as I can find from 
a search of the Web, a public-domain ob- 
ject database system that uses STL as its 
basis. This project tries to fill that void. 


iostream lterators 

Standard C++ addresses persistence in the 
STL by providing iostream iterators named 
istream_iterator and ostream _iterator. 
These iterators support forward sequential 
access only. By using them, you can write 
objects to and read objects from disk-based 
containers. Listings One and Two (listings 
begin on page 137) demonstrate this pro- 
cess. The first program, tstiosOl.cpp, con- 
structs a memory-based vector container 
of pseudorandom numbers and uses an 
ostream_iterator<long> iterator as an ar- 
gument to the STL copy algorithm to copy 
the memory-based container to a disk-based 
container. The second program, tstiosO2.cpp, 
constructs an empty memory-based vector 
container with enough space to hold the 
numbers in the disk file. Then it uses an 
istream_iterator<long> iterator as an ar- 
gument in the copy algorithm to copy the 
disk-based container into the memory- 
based container. 

File-based iterators are elegant indeed. 
Just the fact that they can be implemented 
and used so transparently highlights the 
genius (Stepanov) that underpins the 
generic programming model of STL. I see 
three drawbacks to using this approach 
for persistent containers, however. First, 
it takes a lot of code in the application 
to manage the disk container, code that, 
due to its complexity, should be encap- 
sulated once and reused. Second, the 
technique supports only those operations 
that can be performed with forward it- 
erators. Third, the disk containers record 
the objects as text strings by using the 
overloaded ostream::operator<< and 
istream::operator>> insertion and extrac- 
tion operators. Each object is separated 
by a text string specified as an argument 
to the ostream_iterator constructor, and 
you must make sure that the separator is 
something that properly separates input 
strings as recognized by the istream class. 
Objects not only use more disk space 
than they need to, they are variable in 
length depending on their data values. 
Unless you specify the setw manipulator 
or width member function for the disk 
file stream output, an integer with the 
value 12 takes two character positions 
(plus the separator string), and an inte- 
ger with the value 22334 takes five char- 
acter positions. Because of this architec- 
ture, you cannot use random access or 
bidirectional iterators with these disk- 
based containers. 
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Persistence and STL Allocators 

My first thoughts about a persistent STL 
were inspired by Alexander’s statement 
that allocators might be used to encapsu- 
late a persistent memory model. STL con- 
tainers are memory based. The simplest 
application of persistence would save a 
memory-based container on disk to be re- 
trieved later by an instantiation of that 
same container as identified by its file- 
name. The newly instantiated container 
would also be memory based. It could be 
any STL container built from a disk file 
when the container is constructed, and 
saved to the disk file when the container 
is destroyed. From that concept, I pictured 
the code in Listing Three. 

The PAllocator template class in Listing 
Three would encapsulate everything it 
needs to load and save a “templatized” 
container. The specialized IntAllocator class 
would exist only to provide a filename for 
the particular container, in this case a vec- 
tor of int objects. In the example, the file- 
name would be “IntAllocator.” You would 
not have to use the allocator class’s type- 
id name, however. Any string value would 
do. When the program instantiates the per- 
sistent container, the allocator allocates 
memory and loads the memory with ob- 
jects from the disk file. When the program 
destroys the container, the allocator saves 
the objects from memory back to the disk 
file to record any changes the program has 
made to the memory-based container. 

That was the idea. I really wanted to 
be able to implement an interface like the 
one in Listing Three, because it is central 
to the idea that you could instantiate a 
standard container and have it silently ex- 
hibit persistent behavior. Based on what 
Alexander said in that interview, I tried 
diligently to use allocators (and iterators) 
as the key mechanism to implement a per- 
sistent STL. 

Unfortunately, none of what I tried 
worked. It wasn’t Stepanov’s fault. In that 
same response, he observed that: 


In October 1994...there was strong inter- 
est [among the Object Database Manage- 
ment Group] to make the containers with- 
in their emerging interface to conform to 
STL. They were not looking at the alloca- 
tors as such. 


I should have paid closer attention. 

Here’s why it didn’t work. STL con- 
tainers typically instantiate an allocator ob- 
ject in the container’s constructor, although 
exactly how they manage that is an un- 
specified implementation detail. Each con- 
tainer has three or four overloaded con- 
structors. (Container adapter constructors 
specify container arguments. This discus- 
sion is about the constructors of the con- 
tainer classes not those of the adapter 
classes.) The first constructor initializes an 


empty container. The second constructor 
constructs a container of specified size 
and fills it with objects of a common val- 
ue. (The map, set, multimap, and multi- 
set containers do not have this construc- 
tor.) The third constructor copies objects 
from a range specified by two iterators, 
presumably that point to another con- 
tainer. The fourth is a copy constructor. 
There are no communications between 
the constructors and their allocators about 
initializing the containers’ allocated mem- 
ory with object values. Consequently, al- 
locators seem not to be the path to per- 
sistence after all, at least not yet. 

I thought I might make the persistent 
allocator concept work by partially spe- 
cializing each container to overload its 
constructors to communicate with the al- 
locator. Until PC compilers implement par- 
tial specialization, however, I cannot ex- 
periment with this approach. 


PTL 

Following all that failed experimentation, 
I caved in and decided to use inheri- 
tance— not to add persistence to object 
classes as Parody does, but to add per- 
sistence to containers. Persistent contain- 
ers, being derived from standard contain- 
ers, have to have their own names. 

I hereby introduce PTL, the Persistent 
Template Library, Version 1. The header 
file ptl.h (available electronically; see “Re- 
source Center,” page 3) implements the 
template classes that constitute PTL. There 
is one class per STL container and some 
base classes. The naming convention for 
the public interface prefixes the STL con- 
tainer name with PTL. Consequently, a 
persistent vector is the template class 
PTLvector, and so on. The top base class 
in the template hierarchy is the P7Zcntr 
class, which is derived from whatever STL 
container class is involved. Intermediate 
classes encapsulate operations common 
to more than one of the containers. For 
example, PTIvector, PTLdeque, and PTTI- 
list are derived from P7TIseq. 

Let’s start with an example. Tstptl01.cpp 
(available electronically) declares a PTIvec- 
tor<int> container. Constructors for PTL 
containers specify their filenames. The li- 
brary adds the file extension .ptl to the file- 
name. The declaration instantiates a mem- 
ory-based vector container that contains 
whatever was stored in the vector the last 
time one was instantiated with that particu- 
lar filename. The first time you run the pro- 
gram, the file does not exist, and the dec- 
laration instantiates an empty vector. The 
program uses the push_back function to add 
integers to the vector. When the P7Zvector 
object is destroyed, the memory-based vec- 
tor is written to the disk-based container. 

Tstptl02.cpp (also available electroni- 
cally) instantiates the same PTLvector<int> 
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container by using the same filename as 
the constructor argument. This declaration 
results in a vector container that is initial- 
ized with the values from the disk file. The 
program declares an iterator of type P7Ivec- 
tor<int>:.iterator. This iterator is not spe- 
cific to disk-based containers. It is the stan- 
dard vector<int>::iterator object that STL 
declares. The PTL notation works because 
PILvector<int> is indirectly derived from 
vector<int>. Accessing the container after 
it is instantiated uses STL conventions be- 
cause the container is a standard memory- 
based STL container. If the program 
changes the contents of the container (it 
does not in this example), the contents 
are saved to the disk file when the con- 
tainer is destroyed. 

There are other example programs to 
demonstrate all the PTL containers. I'll dis- 
cuss two of them later in this column. 
They all are available electronically. 


To Save or Not to Save 

Currently, the base P7Lcnir class assumes 
that it should always save the contents of 
the container on disk when the contain- 
er object is destroyed. I considered over- 
loading all the STL container functions 
that can change the container—operator//, 

insert(), push(), and so on— and saving 
only if one of them was called at run time. 
I might do that yet, but for now, PTL saves 
the container automatically unless you call 
the suppress_save function. 


PTLStore 

Ptlstore.h and ptlstore.cpp implement the 
PILStore class to manage disk input and 
output of all persistent containers. Ptl- 
store.h overloads the << and >> operators 
with template functions that read and write 
objects of types that have flat memory data 
representations. These overrides use the 
sizeof operator and the object address to 
store the object’s memory image on the 
disk. This approach works with all C++ 
intrinsic types and some user-defined 
types, which I'll address next. Ptlstore.cpp 
includes specialized insertion and extrac- 
tion operator functions for string objects. 
These functions store the length of the 
string data and the string characters. 


Preparing a Class to be Persistent 

The examples discussed so far use con- 
tainers of intrinsic types. PTL works with 
user-defined types if the types are STL- 
compatible and if they conform to certain 
conventions. First, if the class has no vir- 
tual functions (no uptr), no pointers, no 
references, and no embedded objects that 
have any of these things, the class is al- 
ready PTL-compliant because its data rep- 
resentation is a flat memory image of the 
data values— a C struct without pointers. 
Otherwise, if the class refers to other mem- 
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ory or has a uptr, the class has to partici- 
pate in its own persistence by first over- 
loading the insertion operator to save suf- 
ficient data to reconstruct the object and 
then overloading the extraction operator 
to do the reconstruction. The overloaded 
string insertion and extraction operator 
functions in ptlstore.cpp are examples of 
this procedure. 

Date.h, the header file for a Date class, 
has a virtual destructor, so Date objects 
cannot be automatically saved and restored 
by the P7ZStore class. Date.cpp includes 
overloaded insertion and extraction oper- 
ators for both iostreams and PTLStore. You 
can see how similar they are. But the PTI- 
Store functions use P7LStore’s template 
functions to store intrinsic values, which 
work with memory images rather than 
character strings. Note that even though 
the Date class’s data representation is a 
long integer to represent the number of 
days since 1/1/1, their persistent repre- 
sentation uses day, month, and year inte- 
ger values. This approach permits some- 
one to modify the Date class for a different 
data representation without having to 
make data conversions on all the PTL disk 
files that store dates. 

Brothers.h is a header file that defines 
a predicate function to be used by a map 
container to compare string objects with 
a case-insensitive compare. Brothers.h also 
provides a typedef to give the shorthand 
alias, Brothers, to the P7Lmap<std::string, 
Date, ltstr> class just to keep the code as 
brief as possible. Tstptl13.cpp and 
tstptll4.cpp use the Brothers alias to in- 
stantiate P7Zmap containers keyed on 
string values and returning Date values. 


Cross-Container Stores 

The disk files for the containers share a 
common format. The file begins with an 
integer value that specifies how many ob- 
jects are in the container followed by the 
object values in the sequence in which they 
are to be inserted into the memory-based 
container when it is reinstantiated. This 
common format means that you can con- 
struct a persistent vector container and use 
its filename to instantiate a list container, 
for example. There are two exceptions: 
First, maps and multimaps contain two ob- 
jects of possibly different types for each 
container entry, so those files are not in- 
terchangeable with the other containers. 
Second, multiset and multimap containers 
are not compatible with set and map, un- 
less you are willing to forsake the dupli- 
cate entries when you instantiate the non- 
multivalue memory-based containers from 
the multivalue disk-based containers. 


PersistentlemplateLibrary Namespace 
PTL is enclosed in the PersistentTem- 
plateLibrary namespace. Each program 
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that uses PTL starts with a namespace 
alias statement to shorten the notation to 
ptl. Why didn’t I simply use “ptl” as the 
namespace? Well, there is a better chance 
that no one else has used the longer 
name. Programmers use namespace alias- 
es for brevity and can choose one that is 
not in the current scope. Library builders 
have no way of knowing what other li- 
braries a programmer is likely to use. I 
asked two prominent committee mem- 
bers if they knew of any coordinated ef- 
fort to build an international namespace 
registry. Apparently, no such effort is un- 
derway. P.J. Plauger likes “...Sun’s trick 
with Java libraries: if you own the domain 
plauger.com, you have all rights to the 
package names COM.plauger.*,” a natu- 
ral construction for an Internet-based lan- 
guage. Bjarne Stroustrup says the reign- 
ing mechanism is “...if you call your 
namespace AT&T or Microsoft, you are 
likely to get into trouble,” which puts man- 
agement of namespace registration into 
the hands of the lawyers. Registry by saber 
rattling. 

In any event, for now I’m using Persis- 
tentTemplateLibrary. If that name collides 
with yours, tell me, not your lawyer. I'll 
keep changing it until I come up with 
something that no one else would dream 
of using. 
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What's After PTL? 

PTL has a limitation— the real contain- 
ers are memory based. The disk-based 
containers exist only to support persis- 
tence. You would not use PTL as a sub- 
stitute for a large object database. My 
goal is to implement disk-based sequen- 
tial and associative containers that con- 
form to the STL interface and that offer 
some of the functionality of a relational 
database. I’ll be looking at some of the 
low-level Parody code to see what I can 
reuse. To support bidirectional and 
random-access iterators, such containers 
must provide a way to retrieve an object 
by its position in the container, which 
means that the container storage must 
use fixed-length object images or that the 
container provides an efficient way to lo- 
cate an object’s disk image from some 
position-related index. 

Memory-based objects are always fixed 
length as reported by the sizeof operator. 
Their members, however, can point to dy- 
namically allocated memory of different 
sizes for different objects of the same class. 
The standard string class is an example. 
The sizeof operator will always return the 
same value irrespective of the length of 
the string, which is stored on the heap. 
An efficient object store would use a fixed- 
length memory image for two reasons — 
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first, so that retrievals based on position 
could simply seek to the position as a 
function of the image size; second, so that 
the program could rewrite an image over 
the disk position it previously held and 
delete objects by adding their fixed-length 
disk record space to a free space list. 

One approach I am considering is the 
development of a persistent heap to con- 
tain the variable-length members of a 
fixed-length object. The persistent heap 
must be able to associate fixed object iden- 
tifiers with allocated blocks so that the 
heap can be reorganized, concatenating 
associated storage and eliminating frag- 
mented free space to optimize the heap’s 
performance. Something similar to the MS- 
DOS FAT system without the fat. 

Not all memory-based container data 
structures would be well-served by disk- 
based containers. Inserting and deleting 
objects in a large disk-based vector con- 
tainer would be an inefficient operation, 
for example, because of the shifting of ob- 
jects. This project has to consider these 
factors and arrive at solutions that make 
sense, not merely strive for disk-based ob- 
ject stores that mimic the behavior of STL 
containers. 
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Help create the fastest-growing 
Online service in the world. 


Honored by FORTUNE magazine as one the coolest companies to work for, and more recently 
highlighted by HotWired as a “Dreamjob” company, Juno Online Services is the nation's second-largest 
e-mail provider, supplying more than 3.5 million members with free e-mail since our launch in late 
April of last year. Currently we are seeking: 


Senior Quality Assurance Engineers Junior/Mid Quality Assurance Engineers 

You would assume a senior role in the quality You would assume a junior role in our quality 
assurance efforts of our Internet products. You assurance team. You would be assigned to either 
would be interacting with the client and the the client or the server development groups, and 
server development groups, formulating and work with them in a test engineering capacity, 
executing various testing strategies. while being guided by the senior quality 


assurance engineers. Should you develop interests 
° Ideally, 3+ years of relevant experience. and aptitude in roles outside quality assurance, 
¢ C, C++ and OOD skills, desired. 


you may be transferred to another area in the 


company after a given time. 
¢ Experience with various testing tools, such 


as OA Partner, SOA Team Test, etc. * 0-2 years of experience. 


* Experience with regression testing, ¢ Experience with C. 


functionality testing, etc. 
nctionality testing, etc * Experience with C++ and OOD, highly 


¢ Experience with formulating and writing desired and nurtured. 
test plans, modules and scripts. 


¢ Knowledgeable in the UNIX and the 
Windows platforms. 


* Experience with the UNIX and NT platforms. 


* Familiarity with the Internet technologies, 
desired. 


° E ith Syb d SOL, desired. 
EEDA. Maayan * Degree in Computer Science, highly desired. 
¢ Familiarity with the Internet technologies, 
pean aie camiumics * Experience with GUI development and/or 

) systems programming, desired. 


¢ Degree in Computer Science, desired. 


We are also looking to hire exceptionally talented computer scientists, systems and network engineers, 
database designers, WWW programmers and other software professionals to join our world-class 
technical staff. 


E-mail your resume to: qajobs@recruit.juno.com AD LINK 411 
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How Do | Implement 
Callbacks with Java’s RMI? 


Govind Seshadri 


ava’s Remote Method Invocation (RMD 

brings distributed- object computing 

to Java. With RMI, Java objects are not 

restricted to operating solely within 
the confines of the host JVM. Rather, they 
can freely roam the Internet, seeking re- 
mote object implementations to service 
their needs. 

Simply having remote objects whose 
methods can be invoked by distributed 
client objects is not sufficient for complex 
interactions, however. For instance, ap- 
plications supporting any kind of collab- 
orative functionality need to implement 
peer-to-peer relationships between inter- 
acting client and server objects. But how 
do you enable the server object to com- 
municate with the client object in a truly 
asynchronous manner? While there are 
several approaches to implementing peer- 
to-peer asynchronous communication, one 
of the more elegant techniques involves 
the use of callbacks. 

Event- driven programming in languages 
like C has traditionally used function point- 
ers to pass references to functions, which 
are then asynchronously invoked in re- 
sponse to an “event” (timers, mouse clicks, 
and the like). Java uses interfaces to give 
you access to the same functionality in an 
object-oriented world. Here, the interface 
defines the methods that may be invoked 
by any object with access to the interface. 
The real “functionality” is present within 
some other object that actually implements 
this interface and it is “called back” by the 
target object. 

Implementing callbacks, however, is not 
without challenges— particularly in a 
distributed-object environment like RMI. 
Callbacks are implemented in RMI by set- 
ting the client itself as an exportable re- 
mote object (that is, make it implement 
some remote interface), then registering a 
reference of this with the remote-server 
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object. This way, the server can asyn- 
chronously invoke the remote methods of 
any connected client in the same way the 
client can asynchronously invoke the 
methods implemented within the remote- 
server objects. 

In this article, I'll examine the intricacies 
of enabling true peer-to-peer RMI interac- 
tion and present a step-by-step approach 
to implementing callbacks. For more in- 
formation on RMI, see “How Do I Use Java 
Remote Method Invocation from an Ap- 
plet?” by Cliff Berg (DDJ, March 1997). 


Implementing Callbacks 

The steps required to implement RMI call- 
backs have much in common with those 
needed to implement a simple remote- 
server object. The key difference is that, 
with RMI, you also have to set up the 
client applet or application as a “remote 
object,” and register its reference with the 
remote-server object. 

To illustrate how this works, I’ll imple- 
ment an application using RMI callbacks. 
In this example, the remote-server object 
(QuoteServer.java) maintains an updatable 
list of quotes. The RMI client applet 
(QuoteApplet.java) can make additions to 
this list in an asynchronous manner at any 
time. In the process, I'll demonstrate the 
use of RMI callbacks by having the serv- 
er object periodically select a quote from 
its list in random fashion, and update each 
of the connected client objects in an asyn- 
chronous manner. 


Specify the remote-server interface. 
The remote-server interface serves to spec- 
ify the functionality implemented by the 
server object. This functionality is then 
made accessible to a client via remote 
methods invocations. 

The remote-server interface (see Listing 
One; listings begin on page 137) lists two 
methods available to client RMI objects. The 
addQuote method is used by remote clients 
to update the list of quotes, and the setQ- 
ClientInterface method is used by clients to 
“register” themselves with the server object. 





Specify the remote- client interface. 
Any RMI client applet/application that ex- 
ports itself as a remote object needs to 
implement a remote-client interface. This 
is what enables the server object to in- 
voke the client functionality in an asyn- 
chronous manner via callbacks. 

The refreshClient method (see Listing 
Two) defines how each remote-client ob- 
ject can be “notified” or updated by the 
server object asynchronously. 

Develop the remote-server object by 
implementing the remote-server in- 
terface. If you have prior experience with 
RMI, you'll notice that nothing really new 
is happening in the implementation of the 
remote-server object in Listing Three. The 
server object maintains two updatable Vec- 
tor objects: one of the quotes, and the oth- 
er to maintain references to all the con- 
nected remote RMI client objects. The 
remote-server object also demonstrates 
the use of callbacks in making asyn- 
chronous updates to the client object. 

Develop the client application or ap- 
plet by implementing the remote- 
client interface. The mypackage.rmiex- 
amples package in Listing Four illustrates 
how you can implement the client appli- 
cation/applet using the remote-client in- 
terface in step 2. 

Compile the Java source files for the 
RMI client and server. Since the RMI 
client applet is sent across the Internet by 
the web server, it is necessary that the ap- 
plet and all stub files and interfaces be ac- 
cessible in some location under the doc- 
ument root directory of the web server; 
for example, create a directory “rmi” some- 
where within the $DOCUMENT_ROOT 
for your web server. Since the Java class- 
es— including the RMI client and serv- 
er— are part of the package mypackage 
miexamples, you will have to create the 
subdirectories $DOCUMENT_ROOT/ 
rmi/mypackage and $DOCUMENT_ROOT/ 
rmi/mypackage/rmiexamples. 

Assuming you have all the Java source 
files within $DOCUMENT_ROOT/rmi, 
you can compile them by going to that 
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directory and executing Example 1. This 
process creates the package mypackage 
miexamples containing the classes OClient- 
Interface.class, QServerInterface.class, Quote- 
Server.class, and QuoteApplet.class. 





Example 1: Compiling Java source 
files. 


input Quote: 


Climb mountains te see lowlands: 162 

you want the rainbow, put up with the rain: 163 
Climb mountains to ses lowlands: 164 

Climné mountains to see lowlands: 165 

Smooth seas do not make skilful sailors: 166 






Figure 1: The RMI client applet. 


server remote objects. 


Example 2: Generating the server skeletons and client stubs for the client and 


Generate the stubs and skeletons for 
the RMI client and server objects. From 
within $DOCUMENT_ROOT/rmi, invoke 
Example 2, which generates the server 
skeletons and client stubs for the client 
and server remote objects within $DOC- 
UMENT_ROOT/rmi/mypackage/rmiex- 
amples. 

QuoteServer_Stub.class and QuoteServ- 
er_Skel.class are the client stubs and serv- 
er skeletons for the remote-server object. 
QuoteApplet_Stub.class and QuoteAp- 
plet_Skel.class are the same for the client- 
remote object. 

Start the remote-server object. Start 
up the remote-server object from within 
$DOCUMENT_ROOT/rmi using java my- 
package.rmiexamples. QuoteServer &. 

Run the client applet after creating 
an HTML file for the RMI client. Since 
Java applets are embedded within an 
HTML file, make sure that you create an 
HTML file (say, rmi.html) to send the 
QuoteApplet.class to the browser or ap- 
pletviewer. 





Since support for JDK 1.1 is still evolv- 
ing within the major browsers, test your 
client applet with the JDK 1.1 applet- 
viewer. You should be able to bring up 
the RMI client applet with your ap- 
pletviewer using appletviewer http:// 
hostname/rmi/rmi.html. Figure 1 shows 
an RMI client applet executing within 
the appletviewer. 


Summary 

Callbacks are not just an elegant comput- 
ing paradigm— they are also a prerequi- 
site for enabling peer-to-peer asyn- 
chronous communication. Here, I’ve 
presented one way you can easily imple- 
ment callbacks within RMI. Although there 
may be other mechanisms for imple- 
menting RMI callbacks, they all rely upon 
the basic principle of setting up the RMI 
client as an exportable remote object, then 
passing its reference to the remote-server 
object. Finally, note that RMI applications 
utilizing callbacks cannot be used across 
firewalls because RMI makes use of HTTP 
tunneling; to get through firewalls, remote- 
client object references cannot use the 
stateless HTTP protocol for persistence. 


DDJ 
(Listings begin on page 137.) 
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Exponential Smooth 


William Stallings 














ou often need to guess the next val- 
ue in a time series based on previ- 
ous values in the time series. For ex- 
ample, the nonpreemptive scheduling 
policy known as “shortest process next” 
(SPN) selects the process with the shortest 
expected processing time (before it is 
blocked by an I/O or system call) to run 
next. For this purpose, the operating sys- 
tem keeps a running average of all of the 


previous processing bursts for each pro-— 


cess, and uses these averages to estimate 
the next bursts. Running averages for esti- 
mating future values also show up in many 
areas of communications protocol design. 


Smoothing Techniques 

Suppose you have a series of measured 
values V(1), V(2), V(3),..., where V(@) is 
the value observed at the 7th observation. 
Assume you want to predict (estimate) the 
value V(K+1) based on the value up 
through V(K). One approach would be 
simply to take the average of observed 
values, as in Figure 1(a), where AV(K) is 
the average of the first K values. The es- 
timate for the next value in the series, 
V(K+1), is equal to AV(K). 

Figure 1(b) shows the expression refor- 
mulated so that it is not necessary to re- 
calculate the entire summation each time. 

Each term in the summation is given 
equal weight; that is, each term is multi- 
plied by the same constant 1/K. Typically, 
you would like to give greater weight to 
more recent instances because they are 
more likely to reflect future behavior. A 
common technique for predicting the next 
value on the basis of a series of past val- 
ues is known as “exponential averaging,” 
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or “exponential smoothing.” Figure 2(a) 
shows how you compute SV(K), the 
“smoothed estimate.” 

Compare Figure 2(a) with Figure 1(b). 
By using a constant value of a(0<a<1), 
independent of the number of past ob- 
servations, you have a circumstance in 
which all past values are considered, but 
more distant ones have less weight. To see 
this more clearly, consider Figure 2(b). 
Since both @ and (1-@) are less than 1, 
each successive term in the preceding 
equation is smaller. For example, for o=0.8, 
the expansion is Figure 2(c). The older the 
observation, the less it is counted in the 
average. 

Figure 3 shows the size of the coefficient 
as a function of its position in the expan- 
sion. The smaller the value of «, the greater 
the weight given to the more-recent ob- 
servations. For &=0.5, virtually all of the 
weight is given to the four or five most re- 
cent observations, whereas for a=0.875, 
the averaging is effectively spread out over 
the ten or so most recent observations. The 
advantage of using a small value of & is 
that the estimate will quickly reflect a rapid 
change in the observed quantity. The dis- 
advantage is that brief surges in the value 
of the observed quantity will result in jerky 
changes in the smoothed average. 

Figure 4 compares simple averaging 
with exponential averaging (for two dif- 
ferent values of ). In Figure 4(a), the ob- 
served value begins at 1, grows gradual- 
ly to a value of 10, and then stays there. 
In Figure 4(b), the observed value begins 
at 20, declines gradually to 10, and then 
stays there. In both cases, you start out 
with an estimate of SV(1)=0. Exponential 
averaging tracks changes in process be- 
havior faster than does simple averaging 
and the smaller value of @ results in a 
more rapid reaction to the change in the 
observed value. 


TCP Retransmission Timer 

In the Transmission Control Protocol (TCP), 
two TCP entities exchange segments over 
a TCP connection. Each side retains a copy 
of each segment that it sends and, if an ac- 
knowledgment is not received within a re- 
transmission timeout (RTO) interval, re- 
sends the segment. Thus, if a segment is 
lost in transit, it will automatically be re- 
sent. If RTO is set too small, then a TCP 
entity may retransmit a segment for which 
an acknowledgment is on its way but de- 
layed because of network congestion; such 
unnecessary retransmissions actually wors- 
en the congestion. On the other hand, if 
RTO is too large, the protocol will be slug- 
gish in responding to a lost segment. 

Initially, TCP used a smoothed average 
to estimate the round-trip time. However, 
this didn’t work well if the round-trip time 
varied significantly. To cope with this, Van 
Jacobson proposed a refinement in the 
standard algorithm that has now been of- 
ficially adopted for TCP. Figure 5 sum- 
marizes this algorithm. 

The Van Jacobson algorithm provides an 
improvement by estimating both the round- 
trip time (RTT) and the standard deviation 
of the RTT. The RTO value is then set to 
the estimated value of RTT plus a constant 
multiplied by the estimated standard devi- 
ation. This enables the use of more rea- 
sonable values of the retransmission timer. 
Standard implementations of TCP use the 
parameters g=1/8=0.125, b=1/4=0.25, and 
f=4. Experience has shown that Van Ja- 
cobson’s algorithm can significantly improve 
TCP performance. 


ATM ABR Service 

Support for bursty data traffic on ATM net- 
works, such as traffic generated by TCP/IP- 
based applications, has traditionally been 
carried on the so-called Unspecified Bit 
Rate (UBR) service. UBR is designed to 
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make use of available capacity on the ATM 
network not consumed by time-sensitive 
traffic such as voice and video. All of this 
unused capacity could be made available 
for the UBR service. This service is suit- 
able for applications that can tolerate vari- 
able delays and some cell losses, which 
is typically true of TCP-based traffic. With 
UBR, cells are forwarded on a first-in-first- 
out (FIFO) basis using the capacity not 
consumed by other services; both delays 
and variable losses are possible. No ini- 
tial commitment is made to a UBR source 
and no feedback concerning congestion 
is provided; this is referred to as a “best- 
effort service.” 





To improve the service provided to 
bursty sources that would otherwise use 
UBR, the Available Bit Rate (ABR) ser- 
vice has been defined. An application us- 
ing ABR specifies a peak cell rate (PCR) 
that it will use and a minimum cell rate 
(MCR) that it requires. The network al- 
locates resources so that all ABR appli- 
cations receive at least their MCR capac- 
ity. Any unused capacity is then shared 
in a fair and controlled fashion among 
all ABR sources. The ABR mechanism 
uses explicit feedback to sources to as- 
sure that capacity is fairly allocated. Any 
capacity not used by ABR sources re- 
mains available for UBR traffic. 


Figure 1: (a) Computing the average; (b) this version doesn't require a 
summation. 





To determine how much capacity to al- 
low to each connection, each ATM switch 
must monitor the traffic through itself. One 
of the techniques recommended by the ATM 
Forum for this purpose uses the equation 
MACRU( )=(1—01)XMACRU—1)+0 X CCRC) for 
each virtual connection through the switch, 
where CCR is a measure of the current cell 
rate on a given connection, and MACR 
(mean allowed cell rate) is an estimate of 
the cell rate for the connection in the next 
sampling period. Typically, @=1/16, so that 
more weight is given to past values of CCR 
than to the current value. Based on the es- 
timates for each connection, if a switch ex- 
periences congestion, it restricts the cell rate 
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of the connections passing through it in a 
manner that is proportional to the estimat- 
ed demand from each connection. 


Real-Time Transport Protocol (RTP) 
Real-time transport protocol (RTP) is de- 
signed to support both point-to-point and 
multicast real-time applications. RTP over- ~e— 0=0.5 
comes three deficiencies in TCP for real- eee 


—s— Simple Average 


time applications: -*— Observation ~2— Simple Average 


—*— Observation 


1234567 8 9101112131415161718 1920 1234567 8 910111213 1415161718 1920 






1. TCP is a point-to-point protocol that 
sets up a connection between two end ee 
points. Therefore, it is not suitable for Figure 4: Comparing (a) simple and (b) exponential averaging. 
multicast distribution. 

2. TCP includes mechanisms for retrans- Only a small weight is given to the most mate. The estimated value of jitter is re- 
mission of lost segments, which then ar- recent observation, so that temporary ported to other participants in the con- 
rive out of order. Such segments are not fluctuations do not invalidate the esti- nection. The jitter measure may provide 
usable in most real-time applications. 

3. TCP contains no convenient mechanism 
for associating timing information with 
segments, which is another real-time re- 
quirement. 
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One key requirement for receivers of 
real-time traffic is to estimate the amount 
of delay variation, or jitter, experienced 
on a connection in order to determine 
buffering requirements. In essence, the 
receiver would like to pass incoming traf- 
fic through a delay buffer that will 
smooth out delay variability so that the 
received data has the same timing char- 
acteristics as the transmitted data. There 
is no simple way to measure this quan- 
tity at the receiver, but it is possible to 
estimate the average jitter in the follow- 
ing way. At a particular receiver, you de- 
fine the following parameters for a giv- 
en source: 
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e SZ) = Timestamp from RTP data packet I. 

e RW) = Time of arrival for RTP data pack- 
et J, expressed in RTP timestamp units. 
The receiver must use the same clock 
frequency (increment interval) as the 
source, but need not synchronize time 
values with the source. 

e DU) = The difference between the in- 
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The value of DW) is calculated as 
DU )=(RUV)—RU-1))-(S)- Sd —-1)). Thus, 
DW) measures how much the spacing be- 
tween arriving packets differs from the 
spacing between transmitted packets. In 
the absence of jitter, the spacings will be 
the same and DW) will have a value of 
0. The interarrival jitter JZ) is calculated 
continuously as each data packet J is re- 
ceived, according to the formula in Fig- 
ure 6. JZ) is calculated as an exponen- 
tial average of observed values of DW). 
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backed by a 30-day unconditional guarantee. Other products 
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a warning of increasing congestion be- 
fore it leads to packet loss. 


Internet Congestion Control 

When routers in a network become con- 
gested to the point of buffer saturation, 
it becomes necessary to discard incom- 
ing or buffered packets. It may be de- 





Figure 5: The Van Jacobson TCP 
retransmission algorithm. 
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sirable for routers to begin to discard 
packets before total saturation occurs. In 
this way, TCP connections receive early 
warning of increased congestion as the 
result of a few lost packets. The TCP 
connections can then back off the vol- 
ume that they generate to avoid more 
catastrophic congestion. The most im- 
portant example of a proactive packet 
discard scheme is Random Early Detec- 
tion ee — nut ~ Floyd. 





Fioure 6: Formula for calculating 
the interarrival jitter JD) continuously 
as each data packet | is received. 





RED has been implemented by a num- 
ber of router vendors. 

In essence, RED monitors the queue 
length for an output port on the router. As- 
sociated with each output buffer are two 
thresholds THmax and THmin. When the 
queue length is less than THmin, no dis- 
card occurs; when it is greater than THmax, 
any arriving packet is discarded. Between 
these two thresholds, an incoming packet 
has an increasing probability of being dis- 
carded as the queue length grows from the 
lower to the higher threshold. The use of 
two thresholds and a sliding probability 
scale affords the opportunity to fine-tune 
the algorithm, and experience has shown 
that it can be effective in anticipating con- 
gestion without wasteful discard. 

However, rather than use the actual 
queue length for the calculation, RED uses 
a smoothed average queue length SQ(K)= 
(1—-W)xSQCK-1)+WxQ(K), where Q is the 
actual queue length, SQ is the smoothed 
average, and W is a weighting factor. 

You might wonder why an average 
queue size is used when it would be sim- 
pler to use the actual queue size. The pur- 
pose of using an average queue size is to 
filter out transient congestion at the router. 
The weight W determines how rapidly SQ 
changes in response to changes in actual 
queue size. Floyd recommends a quite 
small value of 0.002. As a result, SQ lags 
considerably behind changes in actual 
queue size. The use of this small weight 
prevents the algorithm from reacting to 
short bursts of congestion. 


Summary 

Exponential averaging is used in a sur- 
prisingly wide variety of protocols and 
congestion-control algorithms. By vary- 
ing the weighting factor used in the av- 
erage, more or less weight can be given 
to recent observations compared to more 
distant ones. Anyone needing a simple 
technique for making estimates based on 
past observations should be aware of this 
versatile tool. 
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Details of Intel’s 


Virtual 


Mode Extensions (VME) 


Robert R. Collins 


ne and a half years ago, I was con- 
tacted by Intel’s book publisher to 
see if I would consider providing 
technical reviews of Intel’s upcom- 
ing programmer’s reference manual. For 
the preceding 12 months, I had publicly 
complained that the technical quality of 
Intel’s microprocessor manuals had dete- 
riorated to the point that they couldn’t be 
relied upon. Judging by the quality of the 
content, it seemed obvious that nobody 
of technical significance inside of Intel was 
reviewing the manuals (which were be- 
ing written by college interns) before 
dumping them on the engineering com- 
munity. Consequently, when Intel’s pub- 
lishing firm contacted me, I viewed the re- 
quest as a positive indication that Intel 
wanted to rehabilitate its image. I gladly ac- 
cepted the offer, feeling I was being allowed 
to participate in helping to fix Intel’s prob- 
lem, instead of being an antagonist. It was 
akin to voting in an election: Unless you 
participate when asked, you don’t have a 
right to complain about the outcome. 

Upon receiving the manuals, my two pri- 
mary concerns were whether or not they 
were accurate, and whether or not Intel 
was still trying to hide information. As I re- 
viewed the manuscripts, it became clear 
that they were written pretty much like pre- 
vious Intel manuals. They contained some 
inaccuracies and Intel omitted some im- 
portant information. Some of the faulty ar- 
eas concerned information that would ben- 
efit Intel’s competitors. Therefore, it’s hard 
to believe those errors were not intention- 
al. My most important observation didn’t 
concern what was in the manuals— it 
concerned what was missing. 

After my (very) public disclosure of In- 
tel’ Virtual Mode Extensions (VME) in 
November 1995 (see my January 1998 “Un- 
documented Corner” column), I assumed 
Intel would publish all of the missing de- 
tails. Intel had already published some of 
the VME details in its Pentium Pro manu- 
als. However, some of the most important 
details were still omitted. 


Robert is an independent consultant on 


the x86 architecture. He can be reached 
at rcollins@x80.org. 
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Unfortunately, the missing information 
isn’t only important to Intel’s competitors, 
but to every engineer who needs to write 
kernel-level code for Intel processors. In- 
tel’s competitors can easily reverse engineer 
this information; but Joe Engineer doesn’t 
have the time or means to do so— and may 
not know that these features exist. 

Intel still hasn’t released the manuals I 
reviewed, nor has Intel released the re- 
mainder of the VME details. Furthermore, 
the programmer’s reference manual due 
since July 1997 still hasn’t been released. 

In my last column (DD/J, January 1998), 
I began a discussion of VME, mentioning 
that even though Intel had shrouded de- 
tails of VME in a 15-year nondisclosure 
agreement (NDA), publicly available Intel 
manuals included at least 27 references to 
VME. I explained that the original v86 
mode was crippled by poor performance 
when executing legacy DOS applications. 
To increase the performance of DOS box- 
es running in a protected-mode operat- 
ing system (like Windows), Intel created 
VME. Finally, I discussed how VME was 
designed to fix these performance prob- 
lems, and give DOS boxes approximate- 
ly 20 percent better performance than their 
nonVME counterparts. 

In this column, I'll continue this dis- 
cussion of VME, starting with a descrip- 
tion of the various components of VME 
and how they work together. 


Overview of VME Components 
VME support is enabled and disabled by 
setting and clearing the VME bit in Con- 
trol Register 4 (CR4, bit 0). The EFLAGS 
register has been enhanced with two new 
bits: a Virtual Interrupt Flag (VIF), and a 
Virtual Interrupt Pending flag (VIP). When 
CR4.VME=1, the microprocessor uses new 
microcode to process the Interrupt-Flag- 
sensitive (IF-sensitive) instructions CLI, 
STI, PUSHF, POPF, INT-n, and IRET. 
When enabled and running at I/O Per- 
mission Level three GOPL=3), all INT-n in- 
structions are controlled by a new structure 
that resides inside of the Task State Seg- 
ment (TSS). The new structure is called the 
“interrupt redirection bitmap.” Interrupt redi- 
rection always occurs when CR4.VME=1. 





This behavior does not change when 
EFLAGS.IOPL does not equal three 
(IOPL<3). When running at IOPL<3, in ad- 
dition to the enhanced INT-n behavior, IF- 
sensitive instruction are allowed to execute 
without faulting to the Ev86 monitor. 

The TSS has been extended to include 
a 32-byte interrupt redirection bitmap. 32 
bytes are exactly 256 bits (one bit for each 
software interrupt), which can be invoked 
via the INT-n instruction. This bitmap re- 
sides immediately below the I/O permis- 
sion bitmap (see Figure 1). The definition 
of the I/O base field in the TSS is there- 
fore extended and now has dual mean- 
ing. Not only does the I/O base field 
point to the base of the I/O permission 
bitmap, but also to the end (tail) of the 
interrupt redirection bitmap. This struc- 
ture behaves exactly like the I/O per- 
mission bitmap, except that it controls 
software interrupts. Hardware interrupt 
(IRQs) and CPU-generated exceptions are 
not influenced by the interrupt redirection 
bitmap. When an INT-n instruction occurs 
inside of an Enhanced v86 mode task 
(Ev86 task using VME), and its corre- 
sponding bit is set in the interrupt redi- 
rection bitmap, an interrupt will fault to 
the Ev86 monitor— just as a standard v86- 
mode task does in the 80386 and 80486. 
When the interrupt’s corresponding bit is 
clear, the Ev86 task will service the inter- 
rupt without ever leaving Ev86 mode— as 
if the interrupt occurred in native real mode. 


VIF and VIP EFLAGS Bits 
Two new flags—EFLAGS.VIF and 
EFLAGS.VIP— were added to the EFLAGS 
register. These flags are intended for use 
when the IOPL of the Ev86 task is less 
than three (OPL<3). These flags can only 
be purposely modified by the CPL-O (ker- 
nel mode), Ev86 monitor, or an interrupt 
service routine running in kernel mode. 
EFLAGS.VIF is a virtualized version of 
the standard interrupt flag (EFLAGS.IF). 
While the Ev86 task is running, CLI and 
STI instructions will not modify the actu- 
al EFLAGS.IF; instead, these instructions 
modify EFLAGS.VIF. The only exception 
to this rule occurs when EFLAGS.VIP=1. 
When EFLAGS.VIP=1, STI still faults to 
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the Ev86 monitor. The actual value of 
EFLAGS.IF is completely hidden from the 
Ev86 task, since PUSHF, POPF, INT-n, and 
IRET have also been modified to help 
hide the behavior. 

The EFLAGS.VIP flag is a Virtual Inter- 
rupt Pending flag. EFLAGS.VIP can assist 
the Multitasking Operating System (MTOS) 
in sending virtual interrupts to the Ev86 task. 
The easiest way to understand the VIP flag 
is to explain its use in the context of a pro- 
gram running on an 8086. When the 8086 
is in an uninterruptible state (FLAGS.IF=0), 
external interrupts remain pending but don’t 
get serviced. After FLAGS.IF is set (because 
of an STI, POPE, or IRET instruction), the 
pending interrupt is serviced by the CPU. 
EFLAGS.VIF and EFLAGS.VIP are intended 
to serve this same purpose to the MTOS 
running an Ev86 task. Let’s assume your 
Ev86 task was at the same uninterruptible 
point as the previous 8086 example. A 
timer-tick interrupt occurs, and the MTOS 
services the interrupt. During the interrupt 
service routine, the MTOS decides that the 
Ev86 task needs to service this timer tick, 
and sets EFLAGS.VIP=1. After returning, the 
Ev86 task is still in an uninterruptible state 
(EFLAGS.VIF=0). At some later time, the 
Ev86 task attempts to set EFLAGS.IF (via 
STI, POPF, or IRET). When this happens, 
the Ev86 task becomes interruptible, and 
a general-protection fault to the monitor 
immediately occurs (#GP(O)). (There are 
many caveats associated with this behav- 
ior. Those caveats will be discussed in my 
next column.) 


IF-Sensitive Instructions 
To support the new EFLAGS.VIF and 
EFLAGS.VIP flags, changes were needed 





to the instructions that read and write the 
interrupt flag of the EFLAGS register. CLI, 
STI, PUSHF, POPF, INT-n, and IRET all 
had to be changed to support Ev86 mode. 

When an Ev86 task is running at 
IOPL<3, CLI and STI clear and set 
EFLAGS. VIE, instead of faulting to the Ev86 
monitor or affecting EFLAGS.IF. As men- 
tioned earlier, the exception to this rule 
occurs when EFLAGS.VIP=1. 

PUSHF copies the contents of the 
EFLAGS.VIF flag to the FLAGS.IF position 
as it pushes the FLAGS image onto the 
stack. This gives the appearance to the Ev86 
task that STI and CLI are really setting and 
clearing EFLAGS.IF. This appearance is nec- 
essary in case the software attempts to 
check for this condition. Listing One (ist- 
ings begin on page 138) is a code sequence 
that tests for the interrupt flag. In addition 
to moving EFLAGS.VIF to the FLAGS. IF lo- 
cation on the stack image, PUSHF always 
pushes an IOPL image of three onto the 
stack. It is important to remember that the 
Pentium’s IF-sensitive instructions behave 
identically to the Intel486 when IOPL=3, 
even when CR4.VME=1. Therefore, PUSHF 
simulates an IOPL=3 to any software wish- 
ing to read the stack image to determine 
its IOPL. The actual IOPL of the Ev86 task 
never changes during this process. 

POPF works in a similar manner: It 
copies the bit in the FLAGS.IF position to 
EFLAGS.VIF position as it pops the FLAGS 
image from the stack. The Pentium is care- 
ful to make sure that the faked IF and 
IOPL aren’t accidentally copied into the 
real IOPL during the POPF operation. Be- 
fore the FLAGS image is merged into the 
EFLAGS register, the IF image is copied 
to the EFLAGS.VIF slot, and the FLAGS.IF 


Figure 1: Interrupt redirection bit map in the TSS. 
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and FLAGS.IOPL images are cleared. (All 
of the FLAGS register bits are cleared, ex- 
cept the actual IF and IOPL.) Finally, the 
filtered FLAGS image is merged with the 
actual EFLAGS register. A side effect of 
POPF is its handling of the Trap Flag (TF) 
in the FLAGS stack image. If the FLAGS.TF 
bit on the stack image is set, then POPF 
causes a general protection fault before 
any FLAGS values are modified (#GP()). 

The IRET instruction behaves exactly as 
the POPF instruction does with respect to 
EFLAGS.IF, EFLAGS.VIF, and EFLAGS.IOPL. 
IRET and POPF differ in how they handle 
the trap flag from the stack image. If 
FLAGS.TF is set in the FLAGS stack image 
during POPF, a #GP(0) occurs; yet for IRET, 
the #GP does not occur. 

The INT-n instruction is the most com- 
plicated of the IOPL-sensitive instructions. 
INT-n behaves exactly like PUSHF in the 
way it handles EFLAGS.IF, EFLAGS.VIF, 
and EFLAGS.IOPL (provided the interrupt 
instruction has been redirected to the Ev86 
task via the interrupt redirection bitmap). 
However, one of the enhancements to 
Ev86 mode is the ability of the Ev86 task 
to execute software interrupts without 
leaving Ev86 mode. This enhancement has 
been accomplished with the aid of the in- 
terrupt redirection bitmap in the TSS. 
When the corresponding IR bit is set, the 
interrupt will be invoked in exactly the 
same manner as a normal v86 task, by 
faulting to the monitor. When the corre- 
sponding bit is clear, the interrupt handler 
is invoked as if it were executing on an 
8086 processor. In other words, a fault to 
the monitor is never generated, nor is a 
transition to the protected mode interrupt 
handler. The interrupt transition and re- 
turn are done entirely within the Ev86 task. 
The influence of the IR bitmap is best de- 
scribed by the pseudocode in Listing Two. 

To demonstrate how to implement and 
use VME, I’ve included some source-code 
examples (available electronically; see “Re- 
source Center,” page 3) that demonstrate 
how interrupt redirection works with and 
without VME. Table 1 describes the code, 
which demonstrates the following char- 
acteristics of VME: 


e With and without VME (CR4.VME=0 and 
CR4.VME=1). 

e JOPL equals two and three OPL=2 and 
IOPL=3). 

e Various combinations of the descriptor 
privilege-level values in the interrupt 
gate of the global descriptor table (DPL 
of the Interrupt Gate). 

e With and without using interrupt redirec- 
tion (R-bitmap=1 and IR-bitmap=0). 


Conclusion 


The virtual mode extensions are useful 
to memory managers and multitasking 
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operating systems. Memory managers can 
primarily benefit by the use of the in- 
terrupt redirection bitmap to reduce the 
number of switches to and from pro- 
tected mode. This has the added bene- 
fit of reducing the complexity of the in- 
terrupt service routines, as they no longer 
need to reflect software interrupts back 
to the v86 task. 

Multitasking operating systems benefit 
from interrupt redirection and from the 
Virtual interrupt support. The MTOS would 
run with virtual mode extensions enabled, 
and the Ev86 tasks running at IOPL<3. 
This gives an MTOS full benefit of the vir- 
tualization of interrupts. When the MTOS 
wishes to send a virtual interrupt (such as 
a virtual timer tick) to an uninterruptible 
Ev86 task, it will do so by setting 
EFLAGS.VIP=1. When the task becomes 
interruptible, a general protection fault oc- 
curs, and the MTOS will send the virtual 
interrupt to the Ev86 task. This would give 
programs that are timer dependent (such 
as games) a significant performance ad- 
vantage. As an added benefit of using the 
virtualization features of the CPU, even 
more complexity can be removed from 
the Ev86 monitor. The result of using these 
new features is an Ev86 monitor that is 
simpler to implement and maintain than 
its nonEv86 counterpart. 


oh oh oh oh oh CC CD 


Table 1: Source code description. 


As I mentioned, Intel has kept many 
of these details secret. Some parts of VME 
are still undocumented, and will likely 
remain so. All of the CPU microcode al- 
gorithms describing CLI, STI, PUSHF, 
POPF, INT-n, and IRET are not docu- 
mented in any of Intel’s manuals. Even 
though these manuals describe the mi- 
crocode algorithms used by all other in- 
structions, Intel has asserted its right as 
the intellectual property owner by with- 
holding this information. Intel’s com- 
petitors no longer need this informa- 
tion— they’ve already cloned it. Instead, 
withholding this information hurts you 
and me, and every engineer who can 
benefit from this knowledge. 

In my next column, I’ll disclose these 
algorithms and discuss the many caveats 
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of Intel’s VME implementation. I’ll show 
how microcode has been added to im- 
plement VME. During that discussion, I’ll 
show an interesting bug in Intel’s micro- 
code implementation. Accompanying the 
discussion, I’ll demonstrate how much in- 
formation about VME was haphazardly 
left in the manuals after Intel tried to re- 
move it when creating the infamous Ap- 
pendix-H (see my November 1997 column 
for a description of Appendix-H). By ex- 
tracting these quotes and rearranging 
them, I'll let Intel tell its own story about 
why VME was needed and how it was im- 
plemented. All this comes free for the read- 
ing, without an NDA. 


DDJ 
(Listings begin on page 138.) 
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_  € PROGRAMMING 


Listing One 


//[ ------ tstios@1.cpp 
#include <fstream> 
#include <iostream> 
#include <string> 
#include <vector> 
#include <iterator> 
#include <cstdlib> 


const char* fname = "tstiostr.txt"; 

int main() 

{ 
std::ofstream ofile(fname) ; 
std: :ostream_iterator<long> iter(ofile, ""); 
// ------ vector of pseudorandom numbers 
std::vector<long> rno(19@) ; 
for (int i = 9; i < 100; i++) 

rno[i] = rand(); 

/[ ------- write the data 
std::copy(rno.begin(), rno.end(), iter); 
ofile.close(); 
return @; 


Listing Two 


/[ ------ tstios@1.cpp 
#include <fstream> 
#include <iostream> 
#include <string> 
#include <vector> 
#include <iterator> 


const char* fname = "tstiostr.txt"; 

main () 

{ 
std::vector<long> rno(1@@) ; 
std::ifstream infile(fname) ; 
/[ ----- read vector of pseudorandom numbers 
std::istream_iterator<long> in(infile) ; 
std::istream_iterator<long> end; 
std::copy(in, end, rno.begin()); 
[[ ==----- display the input numbers 
for (int i = 9; i < 100; i++) 

Btdsscout, << tnoli) << * *% 

return Q; 


eed 


Listing Three 


#include <vector> 

#include <typeinfo> 

// --- base persistent allocator class 
template <class T> 

class PAllocator : public std::allocator<T> { 


protected: 
PAllocator(const char* filename) ; 
TP eas 
}; 
// --- specialized persistent allocator class 
class IntAllocator : public PAllocator<int> { 
public: 
IntAllocator() : PAllocator<int>(typeid(*this) .name() ) 
‘ 2 
¥3 
int main() 
{ 
std::vector<int, IntAllocator> pints; 
// ... 
return @; 


ed 
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Listing One 


/** Defines the methods available to the client for remote invocation */ 
package mypackage.rmiexamples; 
public interface QServerInterface extends java.rmi.Remote { 
public void addQuote(String quote) throws java.rmi.RemoteException; 
public void setQClientInterface(QClientInterface c) 
throws java.rmi.RemoteException; 


Listing Two 


/** Defines methods available to server for remote invocation as callbacks */ 
package mypackage.rmiexamples; 
public interface QClientInterface extends java.rmi.Remote { 

public void refreshClient (String q) throws java.rmi.RemoteException; 


} 


Listing Three 
package mypackage.rmiexamples; 


import java.rmi.*; 
import java.rmi.server.UnicastRemoteObject; 
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mypackage.rmiexamples.QClientInterface; 
mypackage.rmiexamples.QServerInterface; 
java.util.*; 
java.rmi.registry.LocateRegistry; 


implements QServerInterface, Runnable { 
Vector clientele; //tracks all connected clients 
QClientInterface myQClientInterface; 
Vector quoteList; //maintains updatable list of quotes 
QClientInterface myClientObj; 
Thread clientThread=null1; 
static int counter=0; 


public QuoteServer () throws java.rmi.RemoteException { 


} 


super (); 

// create and initialize the quote list 

quoteList = new Vector(); 

quoteList.addElement ("Climb mountains to see lowlands"); 
quoteList.addElement ("If you want the rainbow, put up with the rain"); 
gquoteList.addElement ("Smooth seas do not make skilful sailors"); 
clientele = new Vector(); 


public void setQClientInterface(QClientInterface c) 


} 


throws RemoteException { 
synchronized (clientele) { 
clientele.addElement(c); 
} 
if (clientThread == null) { 
clientThread = new Thread(this, "clientThread"); 
clientThread.start(); 
} 


private void doIt() { 


} 


synchronized (clientele) { 
Vector backup = (Vector) clientele.clone() ; 
int seed; 
while ((seed = new Random().nextInt()) <=0) ; 
seed = seed % quoteList.size(); 
String data = quoteList.elementAt(seed)+ ": "+ ++counter; 
for (int i=@; i < clientele.size(); i++) { 
myClientObj = (QClientInterface) clientele.elementAt (i); 
try { 
//update the client asynchronously via callback 
myClientObj.refreshClient((String) data ); 
} catch (RemoteException e) { 
System.out.println("client must have disconnected!"); 
//get rid of remote reference for disconnected client 
backup. removeElement (myClient0bj) ; 
if (backup.size() <= @) { 
//no more clients- so stop server thread 
clientele = (Vector) backup.clone(); 
Thread dummy = clientThread; 
clientThread = null; 
dummy. stop(); 


} 
} 
clientele = (Vector) backup.clone() ; 
} //end syncronization on clientele 


public void run() 


{ 


} 


while (true) { 
try { 
//sleep for a second 
Thread. currentThread() .sleep(18@@) ; 
} catch (Exception e) { 
} 
doIt(); 
} 


public void addQuote(String quote) throws RemoteException { 


} 


synchronized (quoteList) { 
//update quote list 
quoteList.addElement (quote) ; 
} 


public static void main(String[] args) { 


System. setSecurityManager (new RMISecurityManager()); 
try { 
System.out.println("QuoteServer.main: creating registry"); 
LocateRegistry.createRegistry (1099) ; 
System.out.println("QuoteServer.main: creating server"); 
QuoteServer myQuoteServer=new QuoteServer() ; 
System.out.println("QuoteServer.main: binding server "); 
Naming.rebind("/QuoteServer", myQuoteServer) ; 
System.out.println("QuoteServer bound in registry"); 
} catch (Exception e) { 
System.out.println("Exception on binding QuoteServer") ; 
System.out.println(e.toString()); 
} 


Listing Four 


package mypackage.rmiexamples; 


import 
import 
import 
import 
import 
import 
import 
public 


mypackage.rmiexamples.QServerInterface; 

mypackage. rmiexamples.QClientInterface; 

java.rmi.*; 

java.rmi.server.*; 

java.util.Random; 

java.awt.*; 

java.applet.*; 

class QuoteApplet extends Applet implements QClientInterface { 


(continued on page 138) 
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(continued from page 137) 
public QServerInterface srvQuote=null; 


//the client uses jdk1.1 event handling mechanism 
void sendQuote_Clicked(java.awt.event.ActionEvent event) { 
try { 
//update the remote server object with a new quote 
srvQuote. addQuote( (String) myQuote. getText ()) ; 
} catch (Exception e) { 
System.out.println(e.toString()); 
} 
} 
public void init() { 
super.init(); 
try { 
//by exporting the client object, turn it into a 
//remote object 
UnicastRemoteObject.exportObject (this) ; 
} catch (Exception e) { 
System.out.println("Could not export client remote object"); 
System.out.println(e) ; 


try { 
//obtain the remote server object's reference by interrogating the 
//rmi registry 
srvQuote = (QServerInterface) 
Naming. lookup ("//"+getCodeBase() . getHost ()+"/QuoteServer") ; 
srvQuote.setQClientInterface(this) ; 
} catch (Exception e) { 
System.out.println("Could not set client interface at server") ; 
System.out.println(e.toString()); 


public void addNotify() { 


//don't worry about the specifics of the 
//following code. All it does is create a GUI 
//for the rmi client 
super.addNotify(); 
setLayout (null) ; 
resize(1016,565) ; 
labell = new java.awt.Label("Input Quote:") ; 
label1.reshape(21,24,72,21); 
add(label1) ; 
myQuote = new java.awt.TextField(); 
myQuote.reshape(104,24,242,21); 
add (myQuote) ; 
sendQuote = new java.awt.Button("Send") ; 
sendQuote. reshape (357,24,48,21) ; 
add(sendQuote) ; 
quoteList = new java.awt.TextArea() ; 
add (quoteList) ; 
quoteList.reshape(36,60,371,125) ; 
//register listners 
Action lAction = new Action(); 
sendQuote.addActionListener (lAction) ; 
} 
java.awt.Label label1; 
java.awt.TextField myQuote; 
java.awt.Button sendQuote; 
java.awt.TextArea quoteList; 
public void refreshClient (String q) throws java.rmi.RemoteException { 
//this is the callback method implementation 
quoteList.append(qt"\n") ; 
} 
class Action implements java.awt.event.ActionListener { 
public void actionPerformed(java.awt.event.ActionEvent event) { 
Object object = event.getSource() ; 
if (object == sendQuote) 
sendQuote_Clicked (event) ; 


} 


UNDOCUMENTED CORNER 


® e 
Listing One 
STI ; Enable interrupts 
PUSHF ; Store FLAGS on stack 
POP AX ; Restore flags into register 
TEST AX, 20@h ; Interrupt flag set? 
Jec label ; Jump on condition 
e e 
Listing Two 


N = INTERRUPT_NUMBER; 
INTERRUPT_BIT_MAP_ PTR = TSS_BASE->IO_PERMISSION_BASE - 32; 
IF INTERRUPT_BIT_MAP_PTR->BIT_NUMBER [N] 
IF (IOPL<3) 
#GP (@) ; 
ELSE 
GOTO INT-FROM-V86-MODE; 
ELSE 
INVOKE_REAL_MODE_STYLE_INTERRUPT_FROM_Ev86_TASK(N) ; 
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Learning ISAPI 


Jeff Cromwell 


ne test for any “how to” book is the 

author’s ability to convey informa- 

tion in such a manner that you can 

easily digest difficult material and 
transfer that knowledge to solve your own 
software-development problems. Although 
a single book is not a panacea to what 
ails software developers, books do serve 
as important instruments in the building 
of the mental tools that all programmers 
need and desire. One such tool that I 
needed was the ability to interface a 
database, server, and web site on Win- 
dows NT 4.0. Although many technolo- 
gies exist to do this, my situation de- 
manded the use of the Microsoft Internet 
Server Application Interface (ISAPI) to 
communicate between these applications. 
In addition to this, I wanted the ability to 
develop programs that can manipulate 
data from the traffic of my company web 
site. Technically, I wanted to perform fil- 
ter control of the site. 

Being unfamiliar with ISAPI, I turned 
to the three books examined here to get 
the essential tools to accomplish the task. 
The first, designated as the most com- 
plete reference, is entitled Using ISAPI, 
and written by Stephen Genusa et al. The 
other two books are more specific and 
deal with the subject matter in the con- 
fines of a specific language. For exam- 
ple, Michael Tracy’s Professional Visual 
C++ ISAPI Programming and ISAPI with 
Visual Basic 5, by Wayne S. Freeze and 
Tim Ritchie are self explanatory. I'll first 
examine the general book by Genusa, 
then move on to the other two. 


Using ISAPI 

Using ISAPI, by Stephen Genusa et al., is 
readable and delivers on its promise to 
teach you what the API is, how it can best 
be used, and to provide a reference. It is 
broken down in typical ISAPI fashion: You 


Jeff is a software engineer for Levi, Ray, and 


Shoup Inc. in St. Louis, Missouri. He can be 
contacted at jcromwell@Irs.com. 
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first paced through brief nels on ie 
Web, NT, and ISAPI. Next comes a dis- 
cussion on the pros and cons of ISAPI us- 
age versus CGI, and an introduction into 
the meat of the book— including a sub- 
stantial discussion on ISAPI extensions and 
filters. The book concludes with advanced 
topics with respect to debugging and 
threading. 

Using ISAPI provides an excellent 
treatment on filters and includes a well- 
developed tutorial that monitors the HTTP 
communication event process. Consider- 
able space is devoted to the issue of the 
information flow in the filter as well as 
ISAPI filter rules. Some of the ISAPI filter 
applications that can port to a real-world 
setting are the custom authentication and 
logging filters. The authors provide an ex- 
cellent treatment of these applications, 
which, at times, can be demanding from 
a technical standpoint. 

While Using ISAPI provides a detailed 
treatment of individual topics mixed with 


nttp://www.ddj.com 


Electronic Review of 
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a seminal application in both C++ and Vi- 
sual Basic, I found it limiting: Reference 
books tend to be unsuitable for quickly 
transmitting an author’s knowledge to 
solve a reader’s specific problems. The 
book’s breadth and richness take consid- 
erable time to digest. I would say that it 
is a necessary addition to my ISAPI library, 
and that the Que books are technically 
sound. However, they often require con- 
siderable effort to follow along and un- 
derstand. The sample applications (“Hel- 
lo World” and “guestbook”) do provide 
some quick instructional coding examples. 
Still, most of the work is focused on a dis- 
cussion of the API and associated param- 
eters. This is necessary, of course, for ref- 
erence, but it hinders readability and 
consistency. If you have time to generate 
sample programs and learn overall con- 
ceptual development, and you desire to 
know the intricacies of ISAPI program- 
ming, you need this book. Otherwise, do 
not expect to use this book right away to 
solve specific problems like interfacing 
databases to web applications. 


Programming ISAPI with Visual Basic 5 
Moving from the general to the specific, 
Programming ISAPI with Visual Basic 5, 
by Wayne S. Freeze and Tim Ritchie, de- 
livers on the promise of showing you how 
to effectively program the ISAPI and effi- 
ciently use ActiveX controls in the devel- 
opment of web pages. Freeze and Ritchie 
build an integrated example application 
called “WebMaster,” which provides in- 
formation on important groupware con- 
cepts such as mail, discussions, and cal- 
endaring. While it is entirely a “how-to” 
book, it does offer good referential treat- 
ment of the OLE ISAPI 2 interface. If you 
are new to the OLE ISAPI 2 interface, this 
book demonstrates how to use a Visual 
Basic program with a web server as an 
ISAPI extension. The rationale behind this 
is to allow the creation of server-side 
ActiveX applications for Microsoft’s Inter- 
net Information Server (IIS). Freeze and 
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(continued from page 141) 

Ritchie quickly state some of the limita- 
tions of using OLEISAPI2, which (as men- 
tioned before) does not let you write IS- 
API filters or allow multiple thread activity. 
The authors go over the basics of objects, 
properties, and methods; and the re- 
mainder of the book is devoted to the 
WebMaster application, which is similar 
to applications like Lotus Notes and Mi- 
crosoft Exchange. 

While there are many potential prob- 
lems with an integrated application ap- 
proach to code instruction, the authors do 
an excellent job of remaining consistent 
throughout the book and adhering to their 
approach. The time and effort you will 
put into the coding example will give you 
the feeling of accomplishment, and their 
systematic code instruction (along with 
sample screen snapshots) lets you repli- 
cate their work easily. If you are seeking 
a solution to a particular problem, how- 
ever, the book’s approach is at times im- 
practical. 

If you have the time, Programming 
ISAPI with Visual Basic 5 is an excellent 
book. The author’s code is well designed 
with both substantial and suitable com- 
ments that provide considerable aid in the 
learning process. The main drawback of 
the book has more to do with the OLE 
ISAPI than with the author’s work. How- 
ever, if you find that going through the 
code of an entire application is the best 
way you learn interface concepts, then 
this is a necessary item for your bookshelf. 


Professional Visual C++ 

ISAPI Programming 

Michael Tracy’s Professional Visual C++ 
ISAPI Programming promises to teach 
you how to program real-world, indus- 
trial-strength ISAPI extensions, and it de- 
livers. For example, I was able to take the 
content provided in Chapter 4; code, test, 
and adapt it to meet my needs; while, at 
the same time, learning the concepts. The 
book is not a reference, but Tracy pro- 
vides invaluable tips and strategies with- 
in the context of the application devel- 
opment. Although it is half the size of the 
other two books, Professional Visual C++ 
ISAPI Programming provides immediate 
access to easily extend the author’s sam- 
ple code. The section on e-mailing form 
data not only teaches you the API, but 
also provides a strategy for those engaged 
in sockets programming. I admit my par- 
tiality to C++, but no matter what the lan- 
guage, Professional Visual C++ ISAPI Pro- 
gramming easily conveys difficult 
concepts in the code. In addition, the 
sample ISAPI debugger included with the 
book is great, and I have used it suc- 
cessfully in debugging my own ISAPI ex- 
tensions. Tracy provides the complete 
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source code, so you can dwell deeper 
into substantive issues about ISAPI pro- 
gramming. One drawback is that the book 
can be, at times, terse, but this is proba- 
bly due to the writing style of the author 
and the length of the book. 

I found the treatment on the multi- 
threading issues, coupled with Tracy’s il- 
lustration on the pitfalls of synchronized 
threading in ISAPI, extremely important. 
Tracy does not provide a detailed treat- 
ment of thread-synchronization objects 
(such as critical sections, mutexes, 
semaphores, and events), but does pro- 
vide the necessary details to use these in 
the context of database application writ- 
ing. In the context of the ISAPI filter, he 
develops the same User Authentication al- 
luded to in Using ISAPI. However, em- 
phasis on both the cache manager and 
the ODBC API provide considerable in- 
sight, and the ready-to-use code can eas- 
ily be implemented in current develop- 
ment projects. 

This book does presume that you have 
some well-developed tools already at your 
disposal (knowledge of both Microsoft’s 
Internet Information Server and MFC, for 
instance). If you want to build applica- 
tions through real-world examples in Vi- 
sual C++ with ISAPI, then this is the book 
for you. 


Conclusion 

All three books would be good additions 
to the ISAPI programmer’s resource tool- 
kit. Apart from the CD-ROMs that come 
with two of the books (Genusa and Freeze 
books), each book fulfills a different need. 


_ Genusa’s book provides an overall refer- 


ence work with a complete description of 
ISAPI, along with some tutorials illustrat- 
ed in both Visual Basic and Visual C++. 
Its main drawback is that it is a typical ref- 
erence book, and must be utilized in that 
way. The other two books are quintessen- 
tial “how-to” books: Freeze relies on us- 
ing the integrated application approach to 
convey VB and ISAPI programming; while 
Tracy demonstrates the use of VC++ with 
ISAPI in the context of real-world appli- 
cations. Both teach you about the neces- 
sary tools within their respective contexts. 
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Competitive Automation’s JOIN 4.1 IP ad- 
dress management software consists of 
two integrated IP management compo- 
nents for UNIX servers, JOIN DHCP and 
JOIN DDNS, and provides improved func- 
tionality, security, and license manage- 
ment. Other changes include an upgrade 
to release 2.3.7 of the Berkeley-DB, which 
manages name and configuration infor- 
mation. JOIN Server Release 4.1 is avail- 
able for Solaris, HPUX, and Digital UNIX. 
Prices start at $1700.00 per server. 
Competitive Automation Inc. 

1050 University Drive, Suite 210 

Menlo Park, CA 94025 

650-321-4006 

http://www join.com/ 


Transparent Language has announced the 
TranscendRT SDK for developing and/or 
incorporating machine translation capa- 
bilities into new or existing applications. 
Typical uses of the TranscendRT SDK in- 
clude adding translation capabilities to in- 
tranet servers, e-mail software, word pro- 
cessors, on-line chat tools, OCR products, 
and the like. The TranscendRT SDK con- 
tains all the necessary DLLs and libraries 
to add full text translation to any applica- 
tion. Products can be equipped to per- 
form bidirectional English to/from Span- 
ish, French, and German translation, and 
unidirectional translations from English to 
Portuguese or Italian. The TranscendRT 
SDK requires a development environment 
that supports Microsoft Visual C++ 5 or 
comparable programming language. Ap- 
plications developed with the SDK will 
run on Windows 95/NT. 

Transparent Language 

22 Proctor Hill Road 

Hollis, NH 03049-0575 

603-465-2230 


http://www.transparent.com/ 


Sequiter Software has released CodeBase 
6.3 for Windows CE, a version of the 
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CodeBase database engine designed 
specifically to run under the limited mem- 
ory constraints of hardware running Win- 
dows CE. CodeBase 6.3 is xBASE com- 
patible, is portable between platforms, and 
supports Java, C, C++, Visual Basic, and 
Delphi. CodeBase programs can be de- 
ployed as single-user, multiuser, or 
client/server applications, without code 
changes or recompilation. Full source code 
of the library is included. 

Sequiter Software 

P.O; BOX 7305 

Greenland, NH 03840 

403-437-2410 

http://www.sequiter.com/ 


Current Network Technologies Corp. 
(CNTC) has announced Version 2.0 of its 
PeerDirect data replication system. PeerDi- 
rect 2.0 can distribute and replicate data 
between different database platforms, us- 
ing ODBC for data access. This release 
features configuration wizards, improved 
performance, a tutorial, and a sample ap- 
plication for assisting developers on dis- 
tributing data across multiple platforms. 
Operating on Windows 95/NT, retail pric- 
ing for PeerDirect starts at $295.00 per 
replication site. 

Current Network Technologies Corp. 
2095 North Sheridan Way, Suite 150 
Mississauga, ON, Canada L5K 2N6 
416-805-9088 

http://www.cntc.com/ 


Tetradyne Software’s SourceView ActiveX 
Control Version 1.0 is a customizable syn- 
tax highlighting text editor component. 
The SourceView ActiveX control can be 
customized for coloring any language syn- 
tax. In most cases, customization can be 
accomplished by setting control proper- 
ties. For more complex syntax require- 
ments, OLE interface hooks are provid- 
ed to plug in a parsing implementation 
developed in any language. Visual Basic, 
Delphi, and C++ examples are provided 
for coloring of C, Pascal, Basic, and HTML 
syntax. Additionally, the SourceView Ac- 
tiveX Control includes a document-view 
architecture, undo/redo, Unicode sup- 
port, and printing support. The Source- 
View ActiveX control sells for $299.00. 
The control include a single developer li- 
cense and provides for unlimited redis- 
tribution with applications. Multidevel- 
oper discounts are also available. The 
sourceView control supports Windows 
95/NT. No additional run-time DLLs are 
required. 

Tetradyne Software Inc. 

2542 South Bascom Avenue, Suite 206 
Campbell, CA 95008 

408-377-6367 


_ http://www.tetradyne.com/ 





Frontier Technologies has announced the 
e-Lock Toolkit Version 2, a package de- 
signed for integrating secure messaging, 
security management, and IP security with 
applications. The e-Lock v.2 Toolkit is 
based upon standards including Secure 
Multipurpose Internet Mail (S/MIME), Pub- 
lic Key Cryptography Standards (PKCS), 
and IP Security (IPSEC). The e-Lock Se- 
cure Messaging Toolkit is a modular, stan- 
dards-based secure messaging toolkit, 
which developers can use to build secure 
capabilities into any type of messaging ap- 
plication. The e-Lock PKI Client Toolkit 
provides an interface into the key and cer- 
tificate management of the e-Lock v.2 pub- 
lic key infrastructure. The e-Lock VPN 
Toolkit provides IPSEC functionality via 
the Authentication Header (AH) and En- 
capsulated Security Protocol (ESP) for IP 
Security. 

Also included in the e-Lock v.2 suite 
are the e-Lock Desktop and the e-Lock 
Director, which support digital signatures, 
encryption, and public key infrastructure 
(PKI). e-Lock Desktop is a desktop ap- 
plication that protects documents and files 
by digitally signing and/or encrypting 
them. e-Lock Director allows security ad- 
ministrators to maintain and manage the 
e-Lock Public Key Infrastructure. Security 
administrators may use the e-Lock Direc- 
tor for managing user registration, key and 
certificate maintenance, public-key direc- 
tory, certificate revocation, certificate val- 
idation, automated key backup, key re- 
covery, and automated key portability 
services. The e-Lock Toolkit v.2 costs 
$1000.00. 

Frontier Technologies Corp. 
1489 W. Oak Shadows Drive 
Oro Valley, AZ 85737 
520-797-0583 
http://www.frontiertech.com/ 


iNet Developer 3.0 for Windows 95/NT is 
a web-site development tool from Picto- 
rius. It includes a site and page editor, an 
application server, and integrated pro- 
gramming environment. Pictorius iNet De- 
veloper 3.0 includes support for COM, De- 
sign Time Controls (DTC), Dynamic 
HTML, and data binding. Pictorius iNet 
Developer 3.0 costs $1495.00. 

Pictorius Inc. 

2000 Barrington Street 

Halifax, NS, Canada B3J 3K1 
902-492-2880 

http://www.pictorius.com/ 


SciTech Software has released SciTech Dis- 
play Doctor 6.0, a software utility for 
graphics cards. SciTech Display Doctor 6.0 
includes automatic graphics chip detec- 
tion, display centering and refresh rate 
control, display power management, and 
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diagnostic tools (such as compatibility test- 
ing and performance profiling). The util- 
ity supports 3-D stereo LCD glasses, and 
VESA VBE 3.0 and VBE/AF 2.0 on over 
250 different SVGA graphics chips. The 
new version includes support for the forth- 
coming VESA VBE 3.0 standard, the VESA 
Generalized Timing Formula (GTP), a pre- 
view of the VESA VBE/SCI (Serial Control 
Interface) specification, and a preview of 
the VESA VBE/AF 2.0 (Accelerator Func- 
tions) specification. SciTech Display Doc- 
tor 6.0 sells for $44.95. 

SciTech Software Inc. 

505 Wall Street 

Chico, CA 95928-5624 

530-894-8400 
http://www.scitechsoft.com/ 


ParaGraph PI Technology released Cal- 
liGrapher 5.0 for Windows CE, a hand- 
writing recognition engine for Windows 
CE. CalliGrapher users can write infor- 
mation directly on screen. Version 5.0 can 
recognize a mixture of cursive and print- 
ed lettering in any orientation, and it also 
supports deferred recognition. CalliGra- 
pher 5.0 is available for a suggested retail 
price of $49.95. 

ParaGraph PI Technology 

2011 N. Shoreline Boulevard, MS 571 
Mountain View, CA 94043 

650-933-3000 
http://www.paragraph.com/ 


Sheridan Software’s Data Widgets 3.0 is 
an ActiveX data grid control that can be 
printed directly to paper or published on 
the web. Other 3.0 features include a 
masked edit capability for easy data vali- 
dation, a custom grid designer, and full 
Intellimouse support. Data Widgets 3.0 
costs $295.00; upgrades are available for 
$129.00. 

Sheridan Software Systems Inc. 

35 Pinelawn Road, Suite 206E 

Melville, NY 11747 

516-753-0985 

http://www.shersoft.com/ 


Microsoft has announced its DirectAni- 
mation API, the animation component 
of the DirectX Media 5.1 SDK. Direct- 
Animation is a media integration and 
run-time API that lets you spiff up web 
pages and applications with 2D and 
3D animation, user interactivity, and in- 
tegration of text, video and audio me- 
dia types. DirectAnimation is included 
in the minimal installation of Internet 
Explorer 4.0 and is available for the Win- 
dows 95/NT 4.0. 

DirectAnimation features include: a con- 
sistent interface for dealing with 2D vec- 
tor graphics and sprites, 3D geometry, text, 
video, and audio media types; uniform 
time/event model that enables coordinat- 
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ed interaction among various media types; 
multimedia controls for setting motion 
paths, sequencing, sprite controls, and low- 
bandwidth vector graphics; and integra- 
tion with Dynamic HTML and Window- 
less Control support. 

The DirectAnimation SDK is available 
on the DirectX Media 5.1 SDK via CD- 
ROM or at http://www.microsoft.com/ 
directx/. 

Microsoft Corp. 

One Microsoft Way 
Redmond, WA 98052 
425-882-8080 
http://www.microsoft.com/ 


Mark Kilgard has released Version 3.6 
of GLUT, a freely available OpenGL util- 
ity toolkit source-code distribution. 
GLUT provides a portable windowing 
API for OpenGL programmers with C, 
C++, Fortran, and Ada program bind- 
ings. GLUT programs can run on 
X/UNIX workstations; Linux PCs using 
Mesa/X; and PCs running Windows 
95/NT. New features include perfor- 
mance optimizations to the Win32 GLUT 
library, improved Win32 GLUT library 
conformance, inclusion of Linas Vep- 
stas’s GLE Tubing and Extrusions Library, 
and many new GLUT-based OpenGL ex- 
amples. GLUT 3.6 is available at http:// 
reality.sgi.com/mjk/glut3/. 


Objectivity for Java from Objectivity is a 
Java-language interface for Objectivi- 
ty/DB, a distributed object database 
management system. Objectivity for Java 
is a multithreaded Java binding imple- 
mentation to support sophisticated Java 
applications. The product employs a sin- 
gle process model to improve perfor- 
mance of Java-based applications. Ob- 
jectivity for Java provides a language 
binding compliant with the ODMG 2.0 
standard and compatibility with most vi- 
sual Application Development Environ- 
ments (ADEs), including standard tools 
for debugging. Objectivity for Java is 
packaged with Objectivity/DB and is 
available for the Windows NT and Sun 
Solaris operating systems. Developer li- 
cense pricing ranges between $8000.00 
and $12,000.00, depending on the op- 
tions ordered. Run-time pricing is vari- 
able, depending on the application. 
Objectivity Inc. 

301B East Evelyn Avenue 

Mountain View, CA 94041 

650-254-7100 
http://www.objectivity.com/ 


Pennington Systems has announced 
XTRAN, an expert system for manipulat- 
ing computer languages, for Year 2000 im- 
pact analysis and reengineering of a wide 
variety of languages, including assemblers, 





C/C++, Fortran, Cobol, Pascal, PL/I, PL/M, 
and a number of proprietary languages. 
XTRAN’s rules language, called “meta- 
code,” provides access to details of the 
processed code, including statements, ex- 
pressions, symbols, even comments. Rules 
can be written to analyze code for Year 
2000 impact, and to automate many code 
changes required for Year 2000 compli- 
ance after analysis has identified the re- 
quired changes. Pennington licenses its 
XTRAN expert system based on the size 
of the code body to be processed, and 
also provides training and assistance in 
the use of XTRAN, as well as general con- 
sulting in software engineering. 
Pennington Systems Inc. 

916 Commons Way 

Princeton, NJ 08540 

609-924-8130 
http://www.pennington.com/ 


Xceed Software has released Xceed Zip 
Compression Library 3.0 and Xceed Zip 
Self-Extractor Module 1.0. Xceed Zip Com- 
press Library 3.0 provides components 
that let developers add zip and unzip 
functionality to their applications. Version 
3.0 includes memory compression, glob- 
al status reports, and file renaming. This 
release includes a Borland C++Builder 
component along with the VBX, OCX, 
and Delphi VCLs. Xceed Zip Compres- 
sion Library costs $199.95. The Self-Ex- 
tractor Module is an add-on module that 
allows creation of fully customizable 16- 
and 32-bit Windows self-extracting zip 
files. Xceed Self-Extractor Module 1.0 costs 
$199.95, or $99.95 when purchased with 
the Zip Compression Library. 

Xceed Software Inc. 

2001 De La Métropole, Bureau 705 
Longueuil, PQ, Canada J4G 189 
514-442-2626 
http://www.xceedsoft.com/ 


Three D Graphics has introduced Per- 
spective JavaChart, a charting program 
that offers a Java Class Library, JavaBean, 
and preconfigured Java 1.1 applet for cre- 
ating charts directly on the web. The pro- 
gram has a full set of properties, meth- 
ods, and user-interface tools for creating 
data-driven graphics. A licensing fee for 
Perspective JavaChart is $995.00 per serv- 
er for executable code. Source code is 
available for developers for an addition- 
al $995.00. 

Three D Graphics 

1801 Avenue of the Stars, Suite 600 

Los Angeles, CA 90067 

310-599-3915 
http://www.threedgraphics.com/ 
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FREE Three-Day Admission to the Product Exhibition 
March 31 - April 2 


Keynote Address: Jack Ganssle 
2001-A Development Odyssey 
Tuesday, March 31 6:00 pm - 7:00 pm 

Special Guest Lecture: Ned Barnholt - Hewlett Packard 
The “Consumerization” of Digital Systems 
Wednesday, April 1 6:00 pm - 7:00 pm 

Opening Night Reception: “A Taste of Chicago” 
Tuesday, March 31 7:00 pm - 8:00 pm 

Motorola Reception 
Wednesday, April 1 7:00 pm 
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It’s not too late to register for exhibiting, making our show-floor the perfect place to find all the 
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Supports 
Windows 95 & 
Windows NT 


SERIAL |/O 


The ONLY Serial Communication Libraries, DLLs, & Tools 
You Will Ever Need For Windows 3.x, Windows 95, 
Windows NT, and MS-DOS. Accept No Less Than The Best. 


http://www.wcscnet.com/home.htm 


COMM-DRYV/LIB™ 
Professional Serial I/O Libraries & DLLs for 
Windows 3.x, Windows 95, NT, & MSDOS 


Supports ALL languages, tools, or applications that can 
call the Windows API. 
© Complete source code to all libraries & DLLs. 
© Same API for Windows 3x, Windows 95, NT, & MS-DOS. 
© High level Hayes compatible modem functions. 
© X, Y, Zmodem file transfers on multiple ports. 


Asynchronous & timed callback to user functions on 

© different serial communication events(modem signals, 
buffer counts, character reception, and more). 

e Supports Visual C/C++, ANSI C/C++, QuickBasic, 
Visual Basic (MSDOS & Windows), Assembly, Access, 


© Transmit data from user callback on any event.. 


© Extensive scanning of input character stream. 


¢ Support most intelligent/all dumb multiport cards(Arnet, AST, 


Boca, Cyclades, Digiboard, GTEK, & many more). 


MS-DOS Serial I/O TSRs & Utilities 
ge ime DOS device driver that allows serial ports to be opened 
like files under MS-DOS & Windows. 
FOSSIL interface for all supported(Supports all BBS, Mailers, 
etc.[PCBoard, Wildcat, Frontdoor, Doorway, Renegade, etc.]) 
@Provides Int!4h, Int2ih, & DAM interfaces. 
®Compatible with Desqview and Windows. 
Real time serial port monitoring to screen and disk. 
©Mini-BBS for file transfer/Spawnable file transfer engine. 
@Supports most intelligent & non-intelligent multiport cards. 


@®@02@@00 @ 


COMM-DRV/VxD™ 
Ultra High Speed Serial I/O VxD For Windows 


Windows(95 & 3.x) DLLs with API calls into the 32bit VxD. 
MS-DOS high speed libraries for calling VxDs from a DOS box. 
All serial communication interrupts at ring 0 in 32 bit mode. 

Uses transmit & receive FIFOs(16450, 16550, 16650, etc.). 

Win32, Win16, Console, & MS-DOS apps active concurrently. 
Over 400K baud under Windows 95 & Windows 3.x!!! 

Built-in Multidrop Protocol & Nine-Bit Protocol(RS485 apps.etc).. 
Compatible with Visual C/C++, Visual Basic, and many more tools. 


Features Supported ByAll Products 


© Remap/Change baud rates/divisor. 

® Product Customization. 

® Hardware/Software Flow Control. 

© Unlimited number of ports active concurrently. 


© 8250/16450/16550 Auto detection/Up to 400 KBaud. 
© 100% Port re-entrant code(Important for multitasking). 
© Support all standard dumb multiport cards. 


WCSC Price List 


COMM-DRV/Lib (DOS ,Win3x,Win95,NT)... 
COMM-DRV/Lib (Competitive Upgrade) .. 
COMM-DRV/VxD 

COMM-DRV/Dos 

4Port Card(16550/8 bits) 

4Port Card(16550/16 bits) 

8Port Card(16550/16 bits) 

8Port Card(16550/16 bits/Ext.Box).. 
Professional Combo (Lib, VxD) $ 
Software Combo(Lib,VxD, Dos) 
Complete Combo (Sftwe+4Port (16550) . .$499. 


4425 Kingwood Dr. Suite 21, Kingwood, TX 77339 
1(800)966-4832 (281)498-4832 


(Fax) (281)568-3334 (BBS) (281)568-6401 
Internet: sales@wescnet.com 


Visa/Mastercard/Discover/AMEX/Checks/Approved P.O. WWW: http://www.wescnet.com/home.htm 
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m »Add Zip/Unzip power to your programs 


DynaZIP  , Royalty-Free, unlimited distribution 
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What does it take to get Microsoft Certified? 


We Can Help! 


Attaining Microsoft Certification is no easy task, But the professional and 
financial rewards are worth it. Transcender exam simulations give you a 
realistic preview of the nature and difficulty of Microsoft certification 
exams. Our exams show you where you need to focus your studies and 
give you detailed SxpInngiions of every answer to every alice 


ion't Pass Guarantee.* 


Ww! ' developer pak $499 
feeds Items 1,2, & choose TWO from 3-7 listed below) 


1; WinArch-l-Cert™ 2.0 (Windows Architecture |) - $149 
2. WinArch-ll-Cert™ 2.0 (Windows Architecture Il) - $149 
3. VB-Cert™ 5.0 (visual Basic 5.0) - $149 

4. VB-Cert™ 4.0 (Visual Basic 4.0) - $129 

5; AccessCert™ 1.0 (Access 2.0) - $129 

6. AccessCert™ a: 0 (Access 7.0) ~ $129 

7. SQL-Cert™ 6.5 (SQL 6.5 Implementation) - $149 


To order, call 615-726-8779, FAX 615-726-8884 


VISA/MC/AMEX/DISC/MO/COD — Add $4 s&h ($25 outside the US) 
ONLINE ORDERING! 


VW W A *See our Web site for conditions. 
et Cc e 
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Transcender® 


Corporation 
242 Louise Ave. 
Nashville, TN 37203 
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»Converts zip files to Windows programs 
»Royalty-Free, unlimited distribution 
»>Makes 16 and 32 bit executables 
»OCX/DLL/VCL and Database interfaces 
»>Includes Wizard, many other tools 


Inner Media, Inc. (800) 962-2949, (603) 465-3216 
Fax: (603) 465-7195 Email: Sales@innermedia.com 


alii OMA AWANAlalat=liaal=cellemexe)an | [800.454.5502 
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ALL PRODUCTS 
INCLUDE COMPLETE 
SOURCE CODE, 
ARE ROYALTY FREE 
AND AVAILABLE FOR 
1b OR 32 BITS. 
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D TE Edit Control (Advanced RTF control) 
BD HTML Viewer/Editor Add-on for TE 

D ReportEase Plus (report writer engine) 
BD SpellTime DLL and dictionary 

D FormPlus (form designer/filler) 

D Rich Text Grid control and ChartPro 





Demos: www.subsystems.com 





SUB SYSTEMS, INC. 
11 Tiger Row, Georgetown, MA 01833 
978-352-9020 Fax: 978-352-9019 
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work Class Library | 


Create high performance VxD and NT driver modules from common 
source with the Tetradyne DFC library. The DFC driver wizard gets you 
started quickly and the DFC classes simplify common driver tasks. 


Free DFC training courses (including free evaluation copy) held in San 
| Jose, CA. Drop by www.tetradyne.com for more information. 


2542 S. Bascom Ave., Suite 206 
Campbell, CA 95008 

(408) 377-6367 FAX: (408) 377-6258 
sales@tetradyne.com 
www.tetradyne.com 
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Interactive Disassembler Pro 


interactive auto-commenting disassembler 
NOW WITH STANDARD LIBRARIES RECOGNITION ! 
background analysis, unlimited size of input file 
EXE, NE, LE, PE, LX, NLM, Binary, ROM and more 
80x86, Pentium , Pentium Pro, MMX, Z80, 8051, 6502, 
68K, 680X, JAVA and more - still $199 + s&h! 


Fast Library Intelligent Recognition Technology 
relegates classical disassembler technology into 
prehistory. Your disassemblies suddenly come alive 
and familiar symbols spring in sight. Get our free 
evaluation from 

www.datarescue.com or www.CCso.com 
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toCode 


“In Clouds to Code | document the design and 
implementation of a real project, from start to 
finish, and I hide nothing. . . . Along the way you'll 
learn about object-oriented analysis and design 


and the UML, as well as COM, computer telepho- 
ny, C++, and design patterns. This is program- 
ming in the trenches.” -Jesse Liberty 


Author, Jesse Liberty 1-861000-95-2,$40 Available 
at bookstores coast-to-coast visit the Wrox Press 
web site-www.wrox.com for a full TOC and 
sample chapter. 1-800-USE-WROX 
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Next Generation 
goo@! Copy Protection 


Customer Satisfaction Guaranteed 
http://www.wibu.com 


WIBU-SYSTEMS introduces the future 

of copy protection. The new WIBU- 

BOX/U combines the unmatched 

effectiveness of hardware based copy (¢) CeBIT 98 

protection with the 100% trouble free HANNOVER 
‘ : : MARCH 19 —- 25, 1998 

operation of the Universal Serial Bus, Hal! 13 Booth B29 

giving you and your cutomers 

something to cheer about. 


GRIFFIN 


Call now for your free Test Kit 


(800) 986 6578 
— PTBCHNOLOG!I E 5 


International 

WIBU-SYSTEMS AG 

Rueppurrer Strasse 54 

North and South America: D-76137 Karlsruhe 


Griffin Technologies, LLC Tel.: +49-721-93172-0 


1617 St. Andrews Drive, Lawrence, KS 66047 Fax: +49-721-93172-22 SYSTE MS 


Tel.: (785) 832-2070 - Fax.: (785) 832-8787 Email: info@wibu.com 
Email: sales@griftech.com - www.griftech.com —http://www.wibu.com Quality the World Trusts 









html++ ,New. 


CGI Class Library 


Compatible with all internet web servers 
Generate interactive web pages in C++ 
Ideal for webitying databases 
No more Perl or scripting 
Automates CGI, cookies, forms, state 





Win32, Win16, OS/2, DOS, Unix, Mac 
FREE DEMO Call 1-800-775-1073 


Tel (606) 245-4175 
Fax (606) 245-9305 
www.dcmicro.com 
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DC Micro 
a> Development 


COMPRESSION DEVELOPMENT TOOLS FORTRAN 
Zip and Unzip from your apps with the : 


The best cross-platform Td XCEED ZIP |20P WR tie Compe Fo 


compression libraries S 
for Win32, Win16, COMPRESSION LIBRARY Create sophisticated state-of-the-art user interfaces for your FORTRAN 
programs. Spindrift’s callable subroutines and functions let you 


DOS, OS/2, Unix, e Easy to use VBX, OCX and VCLs © Have data entry screens, dialog boxes, scrolling list boxes 
Macintosh, and e Fast, compact and reliable pull-down menus, push buttons, help panels, etc. All with full 
- : : mouse support. 
embedded Syste: e Over 40 Zip and Unzip functions! e Much more! 281 subroutines and functions for keyboard, 
e All 16 & 32-bit controls for $199.95 


screen, mouse, DOS control, and hardware status. 
New self-extractor add-on for Xceed Zip lets your Comprehensive demonstration program shows what you can do and how to do it. 
apps create customized 16/32-bit self-extracting 


Now shipping Version 3.0 with all new User Guide. 
Zip files. Only $99 with purchase of Xceed Zip! 16 Bit Compilers - $149 32 Bit Compilers - $289 


Robust 45-function API compresses 
buffers, files, archives, disk spanning, 
encryption, self-extr. EXE's and more. 


FREE DEMO Call 1-800-775-1073 
DC Micro Tel (606) 245-4175 


Fax (606) 245-9305 
<> Development www.demicro.com 
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‘Ic O M INC. 


A Device Driver Technology Company 


PRICE: 


Get the fully functional free trial version! 
www.xceedsoft.com/dobbs 


Xceed Software 1-800-865-2626 1-514-442-2626 
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8 Tanglewood Path « Galena ¢ IL 61036 ¢ USA 
Phone: (815) 777-8240 e¢ FAX: (815) 777-8241] 
E-mail: dgable@galenalink.com 
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GRAPHICS LIBRARIES 


SciTech MGL 
Now FREE with Full Source Code 


Device Drivers 
System Software 
Embedded 


Systems Software 
Networking/Protocol Software 


TCOM is a software technology consulting company specializing in 


device drivers and embedded systems software development. 


Operating Systems: 
HP-UX, VxWorks, pSOS 


HW Platforms: x86, RISC, DSP, ISA, PCI, PCMCIA, VME 


EMail: s.andrews@tcom-inc.com 
Web: hittp://www.tcom-inc.com 
Voice: 973-539-1777; ask for Steve 
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DEVELOPMENT TOOLS 
PARSING JUST GOT EASIER 


Parsing input used to mean tricky logic with lots 
of places for bugs to hide, slipped schedules, 
difficult maintenance. Nobody’s idea of fun. 


Now, more powerful, more convenient, 
AnaGram V1.5 changes all that. You can take 
charge and concentrate on the real problems. 
Expressions? Database queries? Scripts? 
Command languages? No problem. Call for free 
trial copy. 


Anagram by Parsifal Software 
P.O. Box 219, Wayland, MA 01778 


CIS: 72603,1763 info @ parsifalsoft.com 
(800) 879- 2577 Voice/Fax (508) 358-2564 


AD LINK 506 


Network Software 
Development Toolkits 


Win32 NDIS Framework (WinDis32) - Supports 
development of Win32 applications that directly 
access network adapters. 


TDI Client Samples - Use TCP/IP from device drivers. 


Printing Communications Assoc., Inc. 
4201 Brunswick Court 
Smyrna, GA 30080 
(770) 432-4580 


hag >Y/AY AANA mh OFe Urea Wrerey en 
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Internet Version Control System by 


Reliable Software 


Smart Tools for Smart Programmers 


http://www .relisoft.com info @relisoft.com 


Ltelephon 
Win NT, Win 95, SCO, IRIX, UnixWare, Solaris, AIX, 


code - and you won’t | . 
need to change you 
- commenting style! 


YOXLID: siennony  GdfGoge |OolS 
Custom Programming: VB, C, Access, 


800-447-9120 x1334 


Dept 1334, 100 Park Avenue, Suite 360, Rockville MD 20850 
Intl:+1-301-424-3903 Fax:301-762-8185 BBS:301-762-8184 


www.teratech.com/dqj/ 
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DOCUMENTATION TOOLS 7 


- Produce HTML, 


You can fine- 
tune your output 
with DocJet’s 
WYSIWYG 
output editor. 


{SHelp, and 

1S Word 
_ documentation from { 

omments in your 
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> New -Version 2.1. 
~ ages ape » Generates documen- 
tation. directly from the B 
source code. 
» Extracts comments. 
p> User customized 


reports formats. 
> HTML, WinHelp, 
RTF 


» FREE workin 
evaluation a 
www.bbeesoft.com 


os 
2) 4 


1-888-646-1933 


Bumble Bee Software 
P.O. Box 2007 
Westford, MA 01886 


info@bbeesoft.com 





a http://www.aics.edu h 


e Earn B.S. and M.S. in Computer Science 
° Object oriented B.S. program AMERICAN 
¢ New courses in Java, Networking, HTML, MIS 
¢ Approved by more than 275 companies COMPUTER 
¢ Follows ACM/IEEE guidelines SCIENCES 


¢ Thousands of students throughout U.S. gy ACCREDITED 
MEMBER 


Free catalogue 1-800-767-AICS Wold 
or http://www.aics.edu. of Universities 


and Colleges 


SciTech MGL is a complete graphics library for Windows 95/3.1/NT 
&DOS that has been used to develop leading titles like WinQuake® 
and Hexen /®. SciTech MGL can be ported to other OS’s in as few 
as 1000 lines of code. Includes: OpenGL® API support; sprite 
library, Game Framework; hardware triple buffering; support for 
stereo LCD shutter glasses; automatic detection & utilization of VGA, 
Modex, VESA VBE, VBE/AF, WinG, CreateDIBSection & DirectDraw; 
and more. Supports standard C/C++ compilers & Borland Delphi. 

For more information call (530) 894-8400. 
Download SciTech MGL with full source code from our website at: 


www.scitechsoft.com 
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VICTOR 


Image Processing Library 


Fast BMP, TIFF, PCX, GIF, TGA, PNG, JPEG. Adjust 
brightness, contrast, sharpen, create filters, resize, rotate, 
+more of single image, multiple images, or any image area; 
color reduction to optimum, specific, or std. palette; print; 
scan; crop, combine, compare, blend images. 


DOS $199, 16-bit DLL $299, 32-bit DLL $499 


Catenary Systems 
314-962-7833/fax: 314-962-8037 
www.catenary.com/victor 
ask for free demo src avail visa/mc/c.o.d. 


PROLOG TOOLS 


Add Rule-Based : 
Components . 


Diagnose, advise, configure and plan | 
with the Amzi!® LogicServer™ tools & ; 
libraries (DLLs) for C/C#, Java, VB, 
Delphi, Web Servers & more. Win NT ° 
95 3.x & Solaris. Use ODBC, Sockets, = 
Unicode & new OOP extensions. 
FREE Evaluation Version! 
Amz. ine. Emailinto@amzi.com 
-44i +1-513-425-8050 Fax 425-8025 # 
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SCRIPTING LANGUAGES 


CGI scripting is a piece of oom 
cake with Winductor2.1 <—_—, 


@ Use e-mail, FTP, HTTP, 
HTML-parsing and file 1/0 
directly from the scripts 


© Script and orchestrate 
other Windows programs 


@ Easy-to-follow CGI tutorial 


Windu 


Desiderata Software 


WWW. deSISOFT.COM 2 Veron $1.4551,Framingham,MA01701 


Easy and clear syntax 
Native to Windows 
Integrated Heuer —- 
can be debugged wit 
actual POSTed data from 
browser 


Download your free 
evaluation copy today 








CrypKey Software Licensing System 
“Software protection with NO hardware lock and NO disk key ” 


CrypKey is software copy protection that is: 

* completely secure from any disk copy program 

° perfect for CD-ROM or INTERNET distribution! 

* cost effective, user friendly, and 100% guaranteed to satisfy! 


CrypKey can increase your software sales: 
upsell options and levels of your software 
¢ lease or demo your software by runs or time 


° enable or upgrade your customers instantly by phone, fax or email! 


New! unique Ready-To-Try features upon install allows 1 trial period only per customer. 
New! unique Add-On feature-add more options, levels, runs or time to existing licenses. 
New! CrypKey Instant-protects in just 5 minutes with no source code changes. 

CrypKey is completely compatible with MS-DOS, MS-Windows 3.x, Win32s, Win95, Win NT, 


and manages network licenses on all Novell and Microsoft operating system based networks. 
CrypKey is produced by Kenonic Controls Ltd.—engineering and software since 1972. 


Kenonic Controls Limited 
7175 - 12th Street South East 
Calgary, Alberta, Canada T2H 256 
(403) 258-6200 fax: (403) 258-6201 
INTERNET: crypkey@kenonic.com 
WEB: http://www.kenonic.com/crypkey.html 
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w-cost HASP 


www.aks.com 


Fax: 212 564-3377, | 
Int’l Office Tel: + 
Fax: +972 3 537-5796 


ALADDIN 


asp.sales@aks.com 
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The Professional’s Choice 


It's The Ultimate 


Solution To Piracy! 


Call for your FREE guide today! 
800-841-1316 * www.softsec.com 


SOFTWARE @RAINBOW 
SECURITY 


A Rainbow Technologies Company 


The industry founder & the 
world leader have joined forces 
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BugCollector Pro 2.0" 


Puts Bugs In Their Place 


ee and Feature Request Tracking Database 
Tracks 
bugs from initial report through fina! resolution 

Manages 

features from first request through implementation 
Organizes 

everything you need to create a top-quality product 

Sorts ¢ Filters * Reports * Graphs « Exports 
your software development data the way you want it 


Neshitt Software Corporation 
Download a FREE 30-day BugCollector trial from http:/www.nesbitt.com/ 
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Reilelekemeeiive is Security 


Cost effective products for all 

levels of software protection 

and authentication. 

@ CRYPTO-BOX® brand dongles, 
SmartCard, PC Cards, and 
extension cards 


@ Software based metering systems 
@ ActiveX, OCX & VCL components 


1-800-627-9468 


SOFTWARE SECURITY SYSTEMS _ 20 Executive Park West, Suite 2027, Atlanta, GA 30329 








C and C++ DOCUMENTATION TOOLS (v. 7.0) 


e €-CALL ($69) Graphic-tree of caller/called function hierarchy, cross-reference, 
file/function index. 

¢ C-CMT ($69) Creates/inserts/updates comment-blocks (functions/identifiers 
used) for each function. 

e C-METRIE ($59) Calculates path complexity, counts lines with comments, code, 
'C’ statements. 

e G-LIST ($69) Lists and action-diagrams, or reformats source into user-selected 
standard formats. 

 C-REF ($69) Creates cross-reference of local/global/define/parameter identifiers. 





e €-DOC ($199) PACKAGE All 5 programs integrated as DOS program. <10,000 
lines. C-BROWSE Windows graphic-tree viewer. 


e €-DOC Professional ($299) DOS, Windows, 0S/2, 1,000,000+ lines. 
e NEW VER 7.0! WEB HTML REPORTS! 


SOFTWARE BLACKSMITHS INC. email @ swbs.com 
6064 St Ives Way, Mississauga _— Voice/Fax (905) 858-4466 
ONT Canada L5N-4M1 http://www.swbs.com 
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Ur. Jobo s tectronic Heview of Computer Books! 


Independent reviews of 
technical computer books 


Written BY developers 


FOR developers 





Check tt Out! 


www .ddj.com 
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Executive Summaries and Summary Executions 


T.. month, I review two software releases, a CD-ROM, and a legal strategy: 


Windows 95: The PC Versions 
All versions of the Windows graphical user interface from Microsoft are PC versions, but two 
recently offered versions are PC in a new sense— they’re politically correct. 

Windows Lite has all the elements of Internet Explorer stripped out to make it compliant with Judge 
Jackson’s order to Microsoft not to force OEMs to accept Explorer if they want Windows. Windows 
Classic, also offered in compliance with the court order, is a two-year old version of the GUL 

Features: Low marks for both. Internet access really ought to be integrated into any OS or GUI. 

Performance: Classic gets an Adequate here; Lite rates Below Average, since it doesn’t boot. 
However, this has led to an increase in the punctuality of mail sorting here at Stately Swaine 
Manor, due to the inability of the mailroom staff, on whose machine we installed this version, to 
play computer games all morning. 

Ease of use: For Classic, Adequate. For Lite, Not Applicable, since it doesn’t boot. 

Documentation: For both, Excellent. Perfectly captures the spirit of adequacy that pervades 
the product. The failure to boot is clearly covered in a tech note included with the Lite version. 

Executive Summary: Why do I say they’re PC? Because Microsoft chose not to discriminate 
against old and crippled (I mean senior and functionally challenged) software. Three cheers! 

Rating: Four dingbats each. 


Encarta: The Encyclopedia 

This is a huge book, a monumental book, or would be if it weren’t a CD-ROM. The only way to 
review a work of this almost encyclopedic breadth (and length) is to see what it says about a 
subject you know something about. I know something about Bill Gates. 

According to The New Yorker, “after Microsoft bought the Funk & Wagnalls encyclopedia and 
turned it into...Encarta...the entry on Bill Gates changed.” The clause “known as a tough 
competitor who seems to value winning in a competitive environment over money” was changed to 
read, “known for his personal and corporate contributions to charity and educational organizations.” 

Features: Filters block access to questionable topics like antitrust law. 

Performance: Slow. It took way too long to make that change in the Gates entry. 

Ease of use: Haven't used it. 

Documentation: Okay, if you get The New Yorker. 

Executive Summary: Even Bill’s wife and child wouldn't say he was known for his charitable 
contributions. Well, I don’t think they would. Let’s call and check. One ringy-dingy, two ringy- 
“Oh, hi. This is Mike Swaine of Dr. Dobb’s Journal and I was just wondering if you would say 
that Bill was known for his charitable contributions. Uh-huh. A cheap skinflint, you say? Really. 
And how about mommy, does she feel that way too? Uh-huh. Okay, bye-bye.” 

Rating: Two and a half dingbats. 


Microsoft's Legal Strategy: The Titanic 
I think that when a company hires a corporate attorney named “Nuke ’em,” it is making a 
statement about the strategies it will consider in its future legal dealings. 

Features: Suits and other legal actions against the company from other software vendors, the 
U.S. Federal government, nine States, and the European Union. 

Performance: Microsoft has managed to annoy or enrage the U.S. Justice Department, the judge 
trying its case, its OEM partners, the press, and the rest of the software industry. 

Ease of use: Spitting in the face of a Federal judge who holds the company’s future in his 
hands probably makes that judge’s job a little easier for him. 

Documentation: Tons of it. Whoever first called these documents briefs had a fine sense of irony. 

Executive Summary: An excellent strategy if you value winning in a competitive environment 


over money. Most stockholders prefer money. 
' 


Rating: At least three dingbats. 
Michael Swaine 


editor-at-large 
mswaine@cruzio.com 


Dr. Dobb’s Journal, March 1998 


eS 
Se 


oo 


Discover CodeBase 6.3—and get the fastest and smallest 
database engine available, plus xBASE file compatibility! 


Award-Winning performance 5 years in a row! 


_ 





~ Query a million records in just 0.97 of a second, 
or add ten thousand records in just 2.17 seconds. 





Discover these valuable benefits of CodeBase: 


¢ Works with C, C++, Visual Basic, 
Delphi, Java and soon OLE DB 


¢ New ActiveX data-aware controls 
help reduce development times 


¢ Runs under Windows 95, NT, 3.1, CE, DOS, Mac, ; 
OS/2, Solaris, SunOS, AIX, SCO, Linux, UnixWare, - Michael Plasterer, Director of Develooment, 
DEC, Alpha, BSDI... symantec 


¢ FoxPro, Clipper and dBASE file support, 
including full multi-user compatibility 


“CodeBase gives ACT! the fast database access 
contact management users need” 








yea sson 





segue 


sah the dat angen and tern 18S 
. e 
— | i EE Test Drive 


Test drive the new CodeBase 6 
for 30 days with your own code. 
No risk. No obligation. 

Order yours today. 


Call: 403-437-2410 


SEQUITER | Fax: 403-436-2999 


SOFTWARE INC. Email: info@sequiter.com 
The Databas e En gine for P A 0g rammers P.O. Box 783 Greenland NH 03840 Web site: www.seduiter.com 






¢ Small library size helps applications load 
quickly and use less memory 





¢ Fully scalable from stand-alone to 
client/server, without code changes 


¢ Unlimited-seat client/server included 
¢ Transaction processing and logging 

¢ Royalty free distribution 

¢ Full featured report writer included 





©1997 Seauiter Software, Inc. All rights reserved. CodeBase and Seaquiter are trademarks of Sequiter Software Inc. All other product names are trademarks of their respective companies. 
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The leader in Java IDEs announces two powerful new 
solutions for corporate application development. 
















, Visual design, templates and i 
Both include: Osta tT 
Extensive support for JDK1.1 100% pure Java code generation Tia | a 
and JavaBeans Comprehensive Java and Z 


JavaBeans libraries, include JFC™ Web authoring environments 








A complete arsenal of state-of- Everything in our Professional Buy now and get 2, , 
the-art Java technologies. Development Edition plus: = nk LO / 
hoe a FREE T-shirt* | / 
Compilation of pure Java code Database aware JavaBeans, ne 2 
to native x86 Wizards and templates = 
Incremental debugging dbANYWHERE Server (100% 
Ability to work with objects, JDBC Compliant™ middleware) 
packages or files Native support for Informix, Sybase, 
JDK1.02-to-JDK1.1 conversion Oracle and MS SOL Server 1-888-647-4614 HSMO198 
Advanced project management Sybase SQL Anywhere The next generation 
capabilities Ability to connect to over 30 of Java Tools 


databases through ODBC 


Symantec and the Symantec logo are U.S. registered trademarks and Symantec Visual Café is a trademark of Symantec : an 

SYMANTEC Corporation. Java and all Java-based trademarks and gos are trademarks or registered trademarks of Sun Microsystems in AR 
bl, the U.S. and other countries. Microsoft, Windows, the Windows logo and Windows NT are registered trademarks of Microsoft ee .0er~ 

Corporation. Other brands and products are trademarks of their respective holder/s ©1998 Symantec Corporation. Mac OS wictoset 


All rights reserved. In Canada, call 1-800-365-8641. In Australia, call 02-9850-1000. In Europe. call 31-71-535-3111. 
*Offer expires 3-31-98 or as long as supplies last 
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